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Summary 

The papillomavirus major late protein, LI, forms the 
pentameric assembly unit of the viral shell. Recombi- 
nant HPV16 L1 pentamers assemble in vitro into capsid- 
like structures, and truncation often N-terminal residues 
leads to a homogeneous preparation of 12-pentamer, 
icosahedral particles. X-ray crystallographic analysis 
of these particles at 3.5 A resolution shows that L1 
closely resembles VP1 from polyomaviruses. Surface 
loops contain the sites of sequence variation among 
HPV types and the locations of dominant neutralizing 
epitopes. The ease with which small virus-like parti- 
cles may be obtained from L1 expressed in E. coli 
makes them attractive candidate components of a 
papillomavirus vaccine. Their crystal structure also 
provides a starting point for future vaccine design. 

Introduction 

Human papillomaviruses (HPVs) are pathogens of epi- 
thelial surfaces associated with both benign warts and 
malignant tumors (Howley, 1996; Shah and Howley, 
1 996). Over 90 HPV subtypes have been identified based 
on DNA sequence relationships. Strong epidemiologic 
and biochemical evidence supports association of infec- 
tion by certain high-risk HPV subtypes (e.g., HPV16, 18, 
and 31) with subsequent development of human cervical 
cancer (Pisani et al., 1993; Bosch et al„ 1995). 

Papilloma virions contain two virally encoded pro- 
teins, L1 and L2, synthesized late in the infectious cycle. 
These two proteins encapsidate a histone-associated, 
closed circular, double-stranded DNA minichromo- 
some. The outer shell of the virion contains 72 pentamers 
of LI, centered on the vertices of a T = 7 icosahedral 
lattice (Baker et al., 1991; Trus et al., 1997). L2, a largely 
internal protein, is present at about 1/30 the abundance 
of L1 (Kirnbauer et al., 1993). The overall surface organi- 
zation of the papillomavirus shell resembles that of its 
close relatives, murine polyomavirus and SV40, which 
also contain 72 pentamers of a principal capsid protein, 
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VP1 (Rayment et al., 1982). Papillomavirus particles are 
somewhat larger than polyoma virions (600 A diameter, 
rather than 500 A ), and they contain a correspondingly 
larger genome (8 kb rather than 5 kb) (Howley, 1996). 
Despite the close structural relationship, however, the 
genomic organization is distinct, and there is no discern- 
ible sequence similarity between L1 (about 500 residues) 
and VP1 (about 370 residues) (Belnap et al., 1996). 

Recombinant expression of L1 using vaccinia, bacu- 
lovirus, or yeast systems results in the formation of virus- 
like particles (VLPs), which are assemblies of roughly 
72 L1 pentamers in a shell similar to that of the virion 
(Zhou et al., 1991a; Kirnbauer et al., 1992; Hagensee et 
al., 1993; Rose et al., 1993; Sasagawa et al„ 1995). VLPs 
can induce neutralizing antibodies in inoculated ani- 
mals, and immunization with VLPs can protect experi- 
mental animals from subsequent challenge with infec- 
tious virus (Breitburd et al., 1995; Suzich et al., 1995; 
Christensen et al., 1996). Thus, VLPs appear to be excel- 
lent candidates for papillomavirus vaccines. 

Structural analysis of papillomavirus particles has 
been limited by an inability to grow large amounts of 
virus in culture and by a lack of success in obtaining 
suitable crystals of virions purified from warts. We have 
expressed the HPV16 L1 protein in E. coli and deter- 
mined its crystal structure at 3.5 A resolution. The crys- 
tallization conditions favor formation of a 12-pentamer, 
T = 1 icosahedral assembly of L1, which we describe 
as a "small VLP." Assembly experiments in vitro show 
that a short, N-terminal segment of the LI polypeptide 
chain acts as a switch between 72-pentamer, virion-like 
assemblies and small VLPs. The LI subunits have similar 
folded structures as their polyoma VP1 counterparts, 
but the modes of interpentamer association in polyoma 
virions and papilloma small VLPs are quite distinct. The 
L1 structure can be used to direct engineering of chime- 
ric VLP vaccines (Greenstone et al., 1998), locate domi- 
nant neutralizing epitopes, identify mechanisms of neu- 
tralization by antibodies, and rationalize differences 
among viral serotypes. 



Results and Discussion 

Expression, Crystallization, and Assembly 
We expressed HPV16 LI and various truncated forms 
in E. coli, as glutathione S-transferase (GST) fusion pro- 
teins. Contrary to our expectations based on polyoma 
and SV40 VP1 (Liddington et al., 1991), deletions of up 
to 30 C-terminal residues had little or no effect on the 
stability or solubility of the expressed protein, or on 
the capacity of the appropriate N-terminally truncated 
protein to crystallize; C-terminal deletions longer than 
30 residues rendered the L1 extremely sensitive to pro- 
tease. N-terminal truncation was tolerated up to at least 
residue 13, but only products of constructs lacking 10 
N-terminal residues (LI AN-10) gave crystals suitable 
for structure determination. L1 's with N-terminal dele- 
tions of 15 or more residues were unstable in the fusion 
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Table 1. HPV16 LI Trunca 


tions and Their Assembly Properties 


Trypsin 


Apparent Diameter of 


Deletion Sensitivity 


Assembled Particle (A) a 


AN=0 No 


600 


AN=8 No 


600 


AN=9 No 


600 


AN = 10 No 


300 


AN=15 Yes" 


NA 


AN=20 Yes 


NA 


AC=16 No 


600 


AC = 30 No 


600 


AC=46 Yes 


NA 


AC =86 Yes 


NA 


AN/C-x: A designates deletion, N or C designates N or C terminus, 


and x designates number 


of deleted residues. All the N-terminal 


deletions were confirmed by seven cycles of N -terminal sequencing 


using Edman degradation. All the C-terminal deletions were con- 


firmed by DNA sequencing. Trypsin sensitivity was used a measure 


of correct folding. The pre 


rtease treatment was carried out while 


the GST-fused protein wa: 


5 still bound to glutathione beads (see 


Experimental Procedures). 


Sensitivity was determined by SDS- 


PAGE analysis of the bead 




• Many of the 600 A partick 


;s are imperfect, and there are particles 


of other sizes seen on the 


grid (Figure 1, left). The major species, 


however, is a shell with the approximate size of an HPV virion. 


b Most of the protein with t 


his deletion is sensitive to digestion by 


trypsin, but about 5% resis 


,ts degradation. 


NA, not applicable. 





constructs we used (Table 1), and a form with 13 resi- 
dues removed could not be cleaved from the GST. 

We obtained several crystal forms of L1 AN-10, all 
with unit cells much larger than expected for isolated 
pentamers. The various buffers that yielded crystals had 
in all cases pH less than 6.4, and subsequent studies 
showed that low pH favors assembly of LI into VLPs 
(Figure 1). The size and homogeneity of the observed 
particles depend on N-terminal truncation. L1 penta- 
mers with up to nine N-terminal residues removed yield 
VLPs that resemble 72-pentamer virions (Figure 1, left). 
L1 pentamers with 10 residues removed yield 12-pen- 
tamer, small VLPs (Figure 1, center), and addition of 



even a single glycine to the N terminus of the form 
lacking 10 residues switches assembly back to larger 
shells (Table 1). We were unable to test the effect of 
longer deletions on assembly, for reasons of expression 
yields outlined above and in Table 1. The first ordered 
residue in our structure of a small VLP is Lys-20, and 
we believe that had we been able to prepare suitable 
pentamers with up to about 19 N-terminal residues re- 
moved, we would probably have obtained T = 1 assem- 
blies at low pH. 

The presence of the N-terminal segment of L1 thus 
determines the size of the assembled particles. The 
structure described below shows that the N terminus of 
L1 indeed lies at interpentamer contacts. The assembly 
properties of L1 also determine successful crystalliza- 
tion. All crystals that we obtained were of the 12-pen- 
tamer particle. The 72-pentamer shells, formed by in 
vitro assembly of full-length L1, are too irregular for 
successful crystallization (see Figure 1). 

Structure Determination 

The structure was determined using a bloblike model to 
generate very low-resolution, "molecular replacement" 
phases, which were extended from 31 A to about 1 7 A by 
20-fold noncrystallographic symmetry averaging. These 
phases were then used to locate mercury atoms in a 
derivative, and mercury single isomorphous replace- 
ment and noncrystallographic symmetry refinement 
were used together to complete the phasing at 3.5 A. 
Additional details are given in the Experimental Proce- 
dures. 

Description of the Structure 

The T = 1 icosahedral assembly with 12 pentamers 
(Figure 2) has an outer diameter of 318 A. Within each 
pentamer, the principal domains of the five LI subunits 
associate intimately with each other, making a tightly 
linked ring. The less extensive, interpentameric contacts 
are formed by small, laterally projecting, elbow-like do- 
mains, which cluster around three-fold symmetry axes 




Full length Li. T=7 AN-10, T=1 (In vilro assembly) AN-10, T=1 (from crystal) 



Figure 1. Assembly of Virus-like Particles from HPV16 LI Pentamers, Monitored by Negative Stain Electron Microscopy 
(Left) Particles that resemble 72-pentamer virions assemble at pH 5.2 from full-length L1. Note free pentamers in the background. (Center) 
Small, 12-pentamer VLPs assemble at pH 5.2 from L1 AN-10. Again, note free pentamers. (Right) Small VLPs derived from crystals, dissolved 
in low pH buffer. The staining is somewhat different than in the center panel because of the presence of some PEG, but close inspection 
shows the particles to be of indistinguishable morphology. Bars indicate 600 A in all three panels. 
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(Figure 2). The complete particle thus has a markedly 
grooved appearance, with pentagonal "towers," dec- 
orated with stellate overhangs, rising above a more con- 
tinuous "floor." The shape of these morphological units 
clearly corresponds to the pentamers seen by electron 
cryomicroscopy (Belnap et al., 1996; Trus et al., 1997). 
The LI Monomer 

A classical "jelly roll" p sandwich, similar to the one 
found in polyoma and SV40 (Liddington et al., 1991), 
contains residues 20-382 of the 504-residue L1 polypep- 
tide chain (Figures 3 and 4). The orientation of the long 
axis of the domain is approximately radial. Elaborate 
interstrand loops at one end impart a highly sculpted 
appearance to the outward-facing surface of the pen- 
tamer. The C-terminal lateral projections (residues 383- 
475, shown in yellow in Figure 3) are significantly a 
helical. Three helices (h2-4) form the surface of contact 
with other monomers, while a short strand ((3J) and a 
final helix (h5) anchor the projection back to the jelly 
roll. The strand adds to the C edge of the CHEF sheet, 
and the helix tucks into the axial cavity of the pentamer, 
where it has a hydrophobic interface with the base of 
the BIDG sheet. The last 31 residues are disordered and 
project toward the interior of the particle. This segment, 
the most variable among L1 proteins of different papillo- 
maviruses, is rich in basic residues as well as serine 
and threonine; it probably interacts with DNA in the 
virion, and it includes the LI nuclear localization signal 
(Zhou et al., 1991b). In most of our preparations, some 
of the disordered, C-terminal tails have been cleaved, 
suggesting that they are very susceptible to proteolysis 
before assembly. Recombinant L1, fused at its C termi- 
nus to another small protein and expressed in insect 



cells, can assemble into VLPs (Muller et al., 1997); it is 
likely that the fused element packs loosely into the inte- 
rior of the VLP shell. 
The Pentamer 

The tightest contacts within a pentamer are at higher 
radii; the monomers splay apart toward the particle inte- 
rior. The subunit orientation creates, along the pentamer 
axis, an inward-facing, conical hollow, which opens to 
the exterior of the particle through a narrow throat, about 
14 A in diameter. The polypeptide chain backbones of 
adjacent subunits interact directly. The G strand, at the 
inner margin of the BIDG sheet, "visits" the clockwise 
neighboring monomer and augments its CHEF sheet 
(see Figures 3A and 3C), before returning to rejoin its 
own subunit. The loops are also elaborately intertwined. 
The HI loop of one monomer, extending outward, inserts 
between the FG and EF loops of the anticlockwise neigh- 
bor and reaches far enough to contact the FG loop of 
the next neighbor; thus, there are, around the "top" of 
the pentamer, five overlapping bridges of paired HI:FG 
loops across an intervening subunit. Part of the EF loop 
extends outward to the edge of the pentamer, creating 
the five points of the starlike cap. 
Pentamer-Pentamer Contacts 
The only contacts between pentamers come from the 
small, laterally projecting domains (Figures 3A, 3C, and 
3D). The first two helices (h2, h3) in the projecting domain 
form a V-shaped groove, into which fits the amino-termi- 
nal half of the longer, third helix (h4) from a three-fold 
related subunit. The contact is strongly hydrophobic 
(Figure 5). The third helix (h4) connects to and from the 
rest of the domain through an extended polypeptide 
chain, as if to create an adjustably oriented interaction 
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(A) L 

the five-fold axis of a pentamer. The structure 
includes residues 20-474; the first and last 
residues are labeled N(20) and C(474), re- 
spectively. The strands of the p jelly roll (and 
the short helix, hi, in the EF loop) are blue; 
connecting loops are pink; the G1 strand seg- 
ment from the ar ' 



the CHEF sheet of the sut 
nearest the five-fold); the F strand of the 
clockwise neighbor is green (this is the edge 
of the sheet in the neighboring subunit with 
which the G1 segment forms hydrogen 
bonds); and the laterally projecting C-termi- 
nal domain is yellow. Strands (capital letters) 
and helices (h1-h5) are labeled, and the cor- 
responding sequences can be identified from 
Figure 3. A primed label (F') denotes a seg- 
ment of polypeptide chain from the clockwise 
neighbor within the pentamer; a double- 
primed label (G1", one from the anticlock- 

(B) Polyomavirus VP1 monomer, viewed in 
essentially the same direction as the L1 mono- 
mer in (A). Blue, pink, red, and green have 
the same significance as in the representation 

other pentamer (triple-primed letters), is yel- 
low; the proximal part of the arm of the mono- 
mer in the figure, which extends into the 
neighboring pentamer, is light yellow; distal 
parts of this arm are not shown. 

(C) LI pentamer, viewed normal to the five- 
fold axis. The two monomers in the rear have 
been deleted for clarity. The C-terminal pro- 
jection of the central (blue) monomer is shown 

(D) LI pentamer, viewed along the five-fold 
axis from outside the particle. As in (C), the 
C-terminal projection of the bl 



element. In the virion, with 72 rather than 1 2 pentamers, 
the details of this contact must vary. The linkage of h4 
is likely to be a locus of such flexibility. 

The interpentamer contacts are modest, leaving sig- 
nificant gaps in the vicinity of the two-fold axes (Figure 
2). In virions, and in 72-pentamer VLPs, the correspond- 
ing gaps may be filled by parts of the amino-terminal 
segments, which are partly missing and partly disor- 
dered in the small VLPs. 

Comparison with VP1 of Polymaviruses 
The L1 subunit is clearly a close relative of VP1 of poly- 
omaviruses, despite the absence of recognizable se- 
quence similarity. Both form pentamers with radially ori- 
ented jelly roll domains; both link to each other through 
G strand interchange; both use C-terminal projections 
to form interpentamer contacts and to permit the six 
distinct packing environments for the subunit in the vi- 
rion (Liddington et al„ 1991). The outer surface loops of 
L1 are substantially longer than the correponding loops 
of VP1— especially FG and HI, the two that make contact 
across an intervening monomer— but even the frame- 
works of the jelly roll domains are not readily superim- 
posed. The two sheets formed by CHEF by BIDG of L1 



lie more or less parallel to each other, while those of 
VP1 pack so that the p sandwich opens up toward the 
five-fold axis. Moreover, when the pentamer axes are 
aligned, the L1 jelly roll domain tilts away from the axis 
by 24° relative to VP1, so that the LI pentamer is sub- 
stantially wider at its base. 

The most striking and unexpected difference between 
L1 and VP1 is the nature of the C-terminal, interpentamer 
linkages. The C-terminal arm of polyoma and SV40 VP1 
is extended and flexible when the subunit is part of 
an unassembled pentamer (Stehle and Harrison, 1997; 
Chen et al., 1998). In the shell of the virion, each arm 
invades a neighboring pentamer, where it contributes 
an extra strand (pj) to the B edge of the BIDG sheet of 
one subunit and forms additional, partly Ca 2+ -depen- 
dent contacts to another (Liddington et al., 1 991 ; Stehle 
etal., 1994). Moreover, the p\J strand is clamped in place 
by a further strand (|3A) from an N-terminal arm of the 
receiving subunit. Flexibility— the capacity to form a 72- 
pentamer structure— resides in the freedom of the arm 
to emerge from the jelly roll core in various directions 
and in the ability of the proximal residues of the arm to 
pack against other arms in somewhat variable ways. In 
the T = 1 particles seen in our crystals, the C-terminal 
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Figure 4. Amino Acid Sequences of Papillc 
Alignment of LI sequences from three human papillomavirus types (HPV16, 18, and 11), cottontail rabbit papillomavirus (CRPV), and bovine 
papillomavirus type 1 (BPV1) (PV website, 1997: http://hpv-web.lanl.gov). The secondary structural elements identified from the the HPV16 
LI crystal structure are shown as arrows (strands) and rectangular bars (helices) in the line above the sequences. Blue, pink, and yellow 
colors have the same significance as in Figure 3A. The numbers refer to the sequence of HPV16. 



part (residues 383-474) of the L1 polypeptide chain 
forms a projecting structure that presents acceptor and 
donor interaction surfaces, while remaining anchored to 
the jelly roll domain from which it emanates. Flexibility 
appears to reside in the extended strands that connect 
the donor surface (h4) to the rest of the subunit. 

Control of L1 Assembly 

We can in principle imagine that in the 72-pentamer 
virion, the C-terminal arms actually interchange, rather 
than return to their subunit of origin. In small VLPs, the 
arms of interacting pentamers approach each other near 
the N termini of the h4 helices (Figure 5B), and relatively 
minor local rearrangements could be sufficient to create 
an interchanged structure. The data presented in Table 
1 and Figure 1 indicate, however, that even in solution, 
the recombinant L1 pentamers we express in E. coli 
have anchored C-terminal arms, folded back as in the 
small VLP. C-terminal deletions that include any resi- 
dues in helix h5, which anchors the arm, make the LI 
extremely protease sensitive, indicating that an ordered 
h5 is important for stability (Table 1). Moreover, treat- 
ment of LI pentamers with trypsin leads to a cleavage 
in the C-terminal projection, probably near one end of 
h4, but the short C-terminal fragment thus produced 
remains associated with the rest of the pentamer, pre- 
sumably through tight interactions of h5 (data not 
shown). Full-length recombinant pentamers assemble 



in vitro into virion-like shells (Figure 1, left). The condi- 
tions that lead to this assembly, which are the same as 
those that produce small VLPs from N-terminally trun- 
cated LI (Figure 1, center), are unlikely to promote disso- 
ciation of the h5 anchor from its site on the subunit to 
which it belongs and insertion of the anchor at a related 
site on another pentamer. Thus, we conclude that as- 
sembly of virion-size particles can proceed without arm 
interchange. It is still possible that in vivo assembly in 
the presence of appropriate chaperones, which might 
protect exposed h5, does involve arm interchange. This 
mode of association would be a form of "domain swap, " 
now documented to occur in many cases in addition to 
virus assembly (Schlunegger et al., 1997). This possibil- 
ity can be tested by comparing properties of 72-pen- 
tamer particles assembled in vitro with those derived 
from expression in eukaryotic cells. 

Why does deletion of ten N-terminal residues lead to 
assembly of a 12-pentamer rather than a 72-pentamer 
shell? The first ordered residue in our structure, Lys-20, 
lies near h2 in the C-terminal projection. The N terminus 
of the protein will thus be adjacent to the h3/h4 interpen- 
tamer contact, consistent with its observed effects. The 
amino-terminal, 20-residue segment is also a good can- 
didate to form a structure that can fill interpentamer 
gaps in virions and VLPs (Figure 4). Unlike the largely 
basic, C-terminal disordered segment, the N-terminal 
region of LI is relatively well conserved, both in length 
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Figure 5. Five-Fold and Three-Fold Interactions in the HPV16 LI 
T = 1 Particle 

(A) View along a five-fold axis, showing how helix 4 from each subunit 
projects laterally and rests in a cradle formed by helices 2 and 3 of 
a subunit in an adjacent pentamer. All subunits in the central pen- 
tamer are red; two monomers in each adjacent pentamer are shown 
in yellow and green, respectively. The region surrounded by a 
dashed box is magnified in (B), where the contacts around a three- 
fold axis are displayed in detail. The strands of the p jelly roll in 
each monomer are blue; the C-terminal projections are red, yellow, 
and green, corresponding to the colors of the monomers in (A). 
The hydrophobic side chain contacts between helix 4 and the helix 
2-helix 3 cradle are shown explicitly. 

and in sequence, and the position of residue 20 is pre- 
cisely consistent with a gap-filling function. In SV40 and 
polyoma virions, it is a p hairpin formed by a stretch of 
about 20 residues at the extreme C terminus of the 
polypeptide chain that has this role (Liddington et al„ 
1991). 

Two cysteines, conserved among all papillomavi- 
ruses, have been reported to participate in VLP forma- 
tion and virion stabilization, presumably through an in- 
terpentamer disulfide bond (Li et al., 1998; Sapp et al., 
1998). These residues, 175 and 428 in HPV16, are not 
linked in our structure, and they are too distant from 
each other or from any cysteine in a neighboring pen- 
tamer to engage in disulfide bonds. In the larger, 72- 
pentamer shell, helix h4 (the loosely tethered contact 



helix: Figure 5B) will have a somewhat different orienta- 
tion than in the small VLP, and Cys-428 at its C-terminal 
end may be close enough to Cys-428 of a contacting 
subunit that the two can form an interpentamer disulfide 
bridge. A similar set of disulfides appears to occur in 
SV40, through a cysteine at the base of the CD loop. 

L1 Variability 

We have used a set of LI sequences from 49 different 
HPV types (PV website, 1997: http://hpv-web.lanl.gov) to 
map the correlation of structure and sequence variation. 
Alignment shows that highly variable stretches are inter- 
spersed among segments of conserved residues. Dis- 
play of conservation in three dimensions by color coding 
a model (Figure 6) reveals that all the hypervariable re- 
gions lie on the outward-facing surface of the pentamer. 
The lack of conservation may come simply from drift 
due to weak functional constraints, from negative selec- 
tion by neutralizing antibodies or other components of 
an immune response, or from adaptation to the use 
of different cell surface receptors. Because even the 
subunits' solvent-exposed lateral faces, which lie be- 
neath the overhang of the star-shaped "cap," are far 
less variable than the loops on the top, we favor the 
notion that the changes have been fixed during evolution 
of the various HPV types by interaction with host func- 
tions. There is no evidence for tissue specificity of papil- 
lomavirus receptors (Roden et al., 1996a; Evander et al„ 
1997), and selection for escape from some component 
of the immune system appears to be the most likely 
interpretation of the observed pattern of variability. 

Potential Receptor Sites 

The outer surface of the LI pentamer has five broad 
pockets, created by the BC, EF, and FG loops. While 
the rim of the LI pocket is extremely variable, the floor is 
somewhat more conserved. These characteristics make 
the pockets likely candidates for receptor interaction, 
and indeed the receptor binding pockets on polyomavi- 
rus VP1 are in structurally homologous positions (Stehle 
et al., 1994; Stehle and Harrison, 1996). 

Receptors for papillomaviruses have not been defini- 
tively identified. The presence of a6 integrin on the cell 
surface appears to be required for detectable HPB6b 
VLP binding, and it has been proposed that members of 
this class of integrins participate in HPB6b viral uptake 
(Evander et al„ 1997). Functional receptors are present 
on a broad range of cell types, and bovine and human 
papillomaviruses can enter by the same pathway (Roden 
et al., 1996a). Despite their limited host range and tro- 
pism, papillomaviruses can bind to cells derived from a 
variety of tissues and from various species (Roden et al., 
1994a;Mulleretal., 1995; Volpersetal., 1995). Moreover, 
particles containing bovine papillomavirus type 1 (BPV1) 
genomic DNA encapsidated by LI /L2 from either HPV1 6 
or BPV1 can infect the same mouse cell line (Roden et 
al., 1996a). 

Neutralizing Epitopes 

Antibody-mediated neutralization of papillomaviruses 
appears to have at least two distinct mechanisms. Some 
neutralizing antisera and monoclonal antibodies (mAbs) 
block cell binding, presumably by steric interference 
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Figure 6. Distri 
Highly variable positions are red, full> 
tion, colored as described, of an L1 
pentamer. (C) Interior surface of the 



in 49 HPV Types 

positions are blue, and positions of intermediate variation are white. (A) Wormlike representa- 
viewed normal to the five-fold. (B) Surface representation of a tilted "side view" of an L1 
ihowing conservation of residues (blue) in the conical cavity along the five-fold axis. 



with the receptor site on the virus; others do not prevent 
attachment (and some even bind cell-attached VLPs) 
but probably inhibit uncoating (Roden et al., 1994b; 
Christensen et al„ 1995; Booy et al., 1998). Structural 
analysis by electron cryomicroscopy of BPV1 decorated 
with two different neutralizing mAbs has yielded images 
consistent with these alternatives (Trus et al., 1 997). One 
of the mAbs, which blocks attachment, binds to the 
outer surface of the pentamer; the other, which does 
not interfere with receptor interaction, binds across the 
groove between pentamers and presumably stabilizes 
the particle by cross-linking. 

Neutralizing antibodies against papillomaviruses are 
highly type specific (Roden et al., 1 996b). Epitopes have 
been mapped for neutralizing mAbs raised against 
HPV16 and HPV11 VLPs (Ludmerer et al„ 1996, 1997; 
Roden et al., 1997; White et al., 1999), and Figure 7A 
shows positions on the pentamer surface where muta- 
tions in L1 influence neutralizing-antibody binding. The 
site for the V5 epitope of HPV16 lies near the center of 
the putative receptor pocket, and attachment inhibition 
is a likely neutralization mechanism. It is possible, how- 
ever, that an IgG bound at this position could bridge 
between pentamers in a virion, where the radial diver- 
gence of pentamers is less marked than in the small, 
T = 1 particle. The V5 site is a dominant neutralizing 
epitope on HPV16; the corresponding mAb competes 
with about 75% of all patient antisera (White et al., 1999). 

Within a principal papillomavirus subtype, variants 
arise with minor differences in LI sequence; these iso- 
lates can differ in their affinity for particular neutralizing 
mAbs (Roden et al., 1997). The locations of sequence 
differences in 85 analyzed variants of HPV16 (Figure 7B) 
are all close to the principal neutralizing epitopes (Figure 
7A), consistent with the notion that fixation of these 
sequence variants is driven by escape from neutraliza- 
tion by antibodies in the host. A distinct pattern of muta- 
tions has been observed in 57 variants of HPV5 (Figure 



7C), suggesting that there may be a rather different set 
of dominant neutralization epitopes for different HPV 
types. 

L2 Interactions 

Like VP2/3 of the polyomaviruses, L2 probably interacts 
with an L1 pentamer in the conical hollow that faces 
inward along the five-fold axis (Griffith et al., 1992). The 
crystal structure of a complex of a VP1 pentamer and 
a fragment of VP2/3 shows that the common C-terminal 
segment of VP2/3 inserts in a looplike fashion into the 
axial cavity, so that both the N-terminal part of VP2/3 
and the very C-terminal part extend toward the interior 
of the virion (Chen et al., 1998). Side chains that line 
the corresponding axial cavity in LI are almost totally 
conserved, and even the narrow "throat" through which 
the hollow connects to the exterior contains invariant 
residues (Figures 6B and 6C). The amino-terminal half 
of the L2 polypeptide chain is likewise quite conserved. 
Both extreme amino- and carboxyl termini of L2 are rich 
in positively charged residues, and it is plausible that 
both interact with the viral genome. MAbs to certain L2 
epitopes bind L1/L2 VLPs, however, and some of these 
mAbs are neutralizing (Campo et al., 1997; Kawana et 
al., 1999). The neutralizing epitope has in one case been 
mapped to the vicinity of L2 residue 108 (Kawana et al., 
1999). This segment of L2 might project as a loop from 
the viral shell, either through fenestrations between pen- 
tamers or through the throat along the pentamer five- 
fold axis. Unlike polyomaviruses, where one internal pro- 
tein associates with each VP1 pentamer, the ratio of L2 
chains to L1 pentamers is distinctly substoichiometric. 
One suggestion is that only the 5-coordinated L1 pen- 
tamers contain L2 (Trus et al., 1997), but a more random 
distribution is also possible. 

Vaccine Strategies 

VLP-based vaccines are effective in preventing papillo- 
mavirus infection in animal models (Breitburd et al„ 



Molecular Cell 
564 




Figure 7. Aspects of the Antigenic Structure of L1 

(A) Positions of neutralization epitopes in HPV16 (blue labels) and HPV11 (red labels), shown schematically on a surface representation of the 
pentamer, viewed along the five-fold axis from the "top." Only one set of five-fold related sites is shown. Open triangle, H16.V5; closed triangle, 
H16.E70; star, H11.B2, H11.F1, and H11.G5; closed square, H11.B2 and M1.F1; open square, H11.F1 and H11.G5; open circle, H11.H3. 
Residue 50, which was shown to affect HPV16 L1 binding to H16.E70 and H16.V5, is situated beneath the surface residue 271 of the FG loop 
and is likely to influence the loop structure centered on this residue (Ludmerer et al., 1996, 1997; Roden et al., 1997; White et al., 1999). 

(B) Positions of sequence variation among HPV16 isolates, shown as yellow patches projected onto the outer surface of the L1 pentamer, 

(C) Positions of sequence variation among HPV5 isolates, shown as red patches on the L1 outer surface. 



1995; Suzich et al., 1995; Christensen et al., 1996), and have described has several implications for vaccine de- 
VLP immunogens prepared from yeast and insect cells sign. First, the T = 1 particle itself— in effect, a small 
are now in phase II clinical trials. The L1 structure we VLP— may be a suitable antigen for administration as a 
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vaccine. Presentation of epitopes on individual penta- 
mers is likely to be identical in the two contexts. The 
higher curvature of a small VLP implies that the pen- 
tamer tips are somewhat more widely separated than 
in a 72-pentamer shell, but the distances suggest that 
IgGs could still form bridges, if such interactions were 
important for B cell receptor affinity. The convenience 
of bacterial expression is significant. Second, the analy- 
sis of epitopes and variants in Figure 7 provides a ratio- 
nal approach to cross-protective VLP (or T = 1) vac- 
cines, by grafting segments of other HPV types— or even 
of other proteins— onto the same L1 matrix (Chackerian 
et al„ 1999). Previous efforts to do so have involved 
effectively random trial and error. Third, the structure 
suggests how new interpentamer interactions could be 
introduced to stabilize the T = 1 particles, and through 
model-building, the 72-pentamer VLPs as well. Efforts 
to obtain the structure of a 72-pentamer particle, by 
fitting the results of high-resolution electron cryomicros- 
copy (Trus et al., 1997) and perhaps by crystallizing 
VLPs, are clearly relevant in this regard. Finally, by anal- 
ogy with our previous work on polyomavirus VP1/VP2 
complexes (Chen et al., 1 998), we can now devise strate- 
gies for determining the location of L2 neutralizing epi- 
topes, through studies of appropriate coassemblies. 

Experimental Procedures 

Protein Preparation and Crystallization 

Recombinant HPV16 L1 was expressed in E. coli as a GST fusion 
protein, using the pGEX-2T expression vector, and purified from the 
supernatant of disrupted cells by glutathione-Sepharose chroma- 
tography. As described in greater detail elsewhere (X. S. C. et al., 
unpublished data), it was necessary to add ATP and 3.5 M urea to 
the cell lysate to release tightly bound GroEL. The GST moiety was 
removed by thrombin cleavage; subsequent Superdex chromatog- 
raphy yielded pure LI pentamers. During crystallization trials, only 
the deletion mutant lacking the N-terminal ten amino acids yielded 
good crystals in the presence of 7.5% PEG in 0.2 M NaOAc buffer 
(pH 4.5). We obtained several crystal forms; the space group of the 
form used to determine the structure was P2,3, a = 387 A. 

In Vitro Assembly 

Purified pentamers produced by the corresponding deletion con- 
structs (Table 1) were diluted from 10 mg/ml to 10 fig/ml with buffer 
containing 0.2 M Na acetate (pH 5.2), 1 M NaCI and allowed to stand 
at room temperature for 30 min. Samples were then spotted onto 
glow-discharged, carbon-coated grids and stained with 2% uranyl 
acetate for electron microscopy with a JEOL 100-CX microscope, 
operated at 80 kV. 

Data Collection 

Both native and mercury derivative (methyl mercury hydroxide) data 
sets were recorded at the CHESS F1 beamline with the ACSD Quan- 
tum-4 ccd detector. Native data extended to 3.5 A; derivative data, 
to 5.5 A. The data were indexed and integrated with DENZO (Otwi- 
nowski and Minor, 1997) and further processed with SCALEPACK 
and CCP4 programs (Collaborative Computational Project, 1994; 
version ccp4_3.3). Statistics are in Table 2. The mean l/cr(l) in the 
last resolution bin is about 2. 

Phase Determination 

Initial, very low resolution phases were obtained by molecular re- 
placement with an approximate model, in which the polyomavirus 
VP1 pentamer was used as a five-fold symmetric "lump" of about 
the correct size. Self-rotation functions were used to determine the 
particle orientation (a single parameter in this space group), and a 
single-parameter position search along the crystallography three- 
fold was used to find the particle center. Two internal parameters 



of the particle (pentamer azimuth and radius) were refined against 
the observed data, using structure-factor correlation as a target, 
before the initial phase calculation. Phases at very low resolution 
(70-31 A) were refined and extended to 1 7 A by 20-fold noncrystallo- 
graphic symmetry (NCS) averaging using RAVE (Kleywegt and 
Jones, 1994). This phase set was adequate for locating the 120 
mercury sites in a crystallographic asymmetric unit. Further 20-fold 
NCS averaging and phase extension based on the mercury SIR 
phases yielded a 3.5 A resolution map with clear side chain features, 
into which the L1 model was built using the program 0 (Jones et 
al„ 1991). Details of the structure determination will be published 
elsewhere (Chen and Harrison, unpublished data). 

Rennement 

The initial model had an R factor of 42% (8-3.5 A). Using 20-fold 
NCS restraints throughout the calculations, 300 steps of positional 
refinement in CNS (Brunger et al., 1998) gave R„o, k = 34.5% and 
R,„ = 35.8%. Torsion-angle molecular dynamics refinement low- 
ered these figures to 33.4% and 34.9%, respectively. Further ther- 
mal-parameter (B) refinement together with positional refinement 
using data between 15 and 3.5 A yielded R„^ = 28.3% and R rra = 
29.1% (see Table 2). All residues of the final model have phi/psi 
angles in the allowed regions of a Ramachandran plot (PROCHECK; 
Laskowski et al., 1993), with 72% in the most favored region. 

Illustrations 

The figures were made using O (Jones et al., 1991) (Figure 2), Mol- 
script (Kraulis, 1991) (Figure 3), Ribbons (Carson, 1987) (Figure 5), 
and GRASP (Nicholls et al.. 1991) (Figures 6 and 7). 
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The LI major capsid proteins of human papillomavirus (HPV) types 11 
and 16 were purified and analyzed for structural integrity and in vitro 
self-assembly. Proteins were expressed in Escherichia coli as glutarhione-S- 
transferase-Ll (GST-LI) fusions and purified to near homogeneity as pen- 
tamers (equivalent to viral capsomeres), after thrombin cleavage from the 
GST moiety and removal of tightly associated GroEL protein. Sequences 
at the amino and carboxy termini contributing to formation of LI penta- 
mers and to in vitro capsid assembly were identified by deletion analysis. 
For both HPV11 and HPV16 LI, up to at least ten residues could be 
deleted from the amino terminus (AN10) and 30 residues from the car- 
boxy terminus (AC30) without affecting pentamer formation. The HPV16 
pentamers assembled into relatively regular, 72-pentamer shells ("virus- 
like particles" or VLPs) at low pH, with the exception of HP VI 6 LI 
AN10, which assembled into a 12-pentamer, T = 1 capsid (small VLP) 
under all conditions tested. The production of large quantities of 
assembly-competent LI, using the expression and purification protocol 
described here, has been useful for crystallographic analysis, and will 
be valuable for studies of virus-receptor interactions and potentially for 
vaccine design. 

© 2001 Academic Press 
Keywords: human papillomavirus; virus assembly; virus capsids; protein 
folding 



Introduction 

The Papovaviridae include two virus subfami- 
lies: papilloma- and polyomaviruses. Polyoma- 
viruses are DNA tumor viruses that transform cells 
in tissue culture and cause tumors in experimental 
animals. Papillomaviruses are important human 
pathogens, whose infection is directly related to 
the subsequent development of cervical cancer for 
high-risk subtypes such as human papillomavirus 
(HPV) 16 and 18 (Bosch et ah, 1995; Pisani et al, 
1993). 

Both papilloma- and polyomavirus capsids have 
a similar overall architecture: 360 identical copies 
of their major capsid protein are arranged as 72 
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EM, electron microscopy; rt, room temperture. 

E-mail address of corresponding author: 
xiaojiang.chen@uchsc.edu 



pentamers at the vertices of a T = 7 icosahedral 
lattice (Rayment et al, 1982; Baker et al, 1991). Sur- 
face feature details, as determined by cryoelectron 
microscopic image reconstructions of virions, differ 
noticeably, however (Belnap et al. , 1996; Trus et al, 
1997). Comparison of VP1 proteins among differ- 
ent polyomaviruses (e.g. SV40, murine polyoma) 
and LI proteins among different papillomaviruses 
(e.g. bovine papilloma, HPV) shows that the pri- 
mary sequences are clearly related within the two 
subfamilies, but that there is no detectable 
sequence similarity between Lis and VPls. 

The crystal structures of the VP1 major capsid 
proteins from murine polyomavirus and SV40 are 
known (Liddington et al, 1991; Stehle et al, 1994). 
In addition, the biochemistry of in vitro capsid 
assembly for recombinant VP1 capsomeres has 
been studied in detail (Garcea et al, 1987; Salunke 
et al, 1989). Polyoma VP1 contains a jellyroll 
|3-barrel core, and outside the jellyroll core there 
are long and flexible arms at both amino and 
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carboxy termini. These arms are dispensable for 
pentamer formation, but parts of both are required 
for inter-pentamer interactions and thus essential 
for particle assembly. The positively charged N- 
terminal segment is a DNA-binding domain and 
likely interacts with the viral DNA. The domain 
organization of HPV LI domain differs from that 
of polyoma VP1 in that the positively charged seg- 
ment is at the extreme carboxy terminus (Li et al, 
1997; Zhou et al, 1991), following a sequence of 70 
residues important for inter-pentameric inter- 
actions (Chen et al, 2000). 

Papillomaviruses are difficult to grow in cell cul- 
ture, but the LI proteins have been produced by 
recombinant expression in Escherichia coli and in 
insect cells using baculovirus vectors (Kirnbauer 
et al, mi; Le Cann et al, 1994; Li et al, 1997; Rose 
et al, 1993; Xi & Banks, 1991). We report here a 
purification strategy for obtaining large quantities 
of pure, well-folded HPV11 and HP VI 6 LI penta- 
mers after recombinant expression in E. coli. Using 
deletion mutagenesis and purification of recombi- 
nant mutant LI proteins, sequences at the amino 
and carboxy termini of LI important for LI folding 
and pentamer formation were identified. The 
carboxy-terminal sequences were important for 
particle assembly, and the amino terminus 
appeared to regulate the size of the icosahedral 
shell. 

Results 

Protein expression in E. coli 

The LI proteins from both HPV11 and HPV16 
were analyzed in parallel in order to generalize the 



findings, as well as to utilize minor differences in 
the primary sequences to define domain bound- 
aries. Both proteins behaved similarly, and only 
the differences are specifically discussed. The LI 
coding sequences were cloned into pUC based vec- 
tors under the Ptac promoter for expression in 
E. coli. Using these constructs, the LI protein was 
not detected by SDS-PAGE in whole cell lysates, 
despite efforts to improve expression by varying 
the DNA sequence near the LI start codon, mutat- 
ing several rare arginine codons, or changing bac- 
terial strains (data not shown). The low level of LI 
accumulation was subsequently determined to be 
due, in part, to protein degradation rather than 
low expression levels (data not shown). As an 
alternative approach, we fused glutathione-S- 
transferase (GST) to the LI amino terminus using 
the pGEX-2T vector (Pharmacia). GST-LI fusion 
proteins were detected by SDS-PAGE of whole cell 
lysates (Figure 1, lanes 2-5), indicating accumu- 
lation to high levels. 



Purification of HPV11 and HPV16 L1 

When expressed at 37 °C, the GST-LI fusions 
were found in the insoluble cell pellet. When 
expression was induced at room temperature (rt), 
however, the GST-LI fusions accumulated to high 
level, and partitioned into the soluble fraction of 
the cell lysates. When the soluble fraction was 
chromatographed on glutathione-Sepharose, the 
GST fusions were retained on the column, as 
shown by the 80 kDa species seen by SDS-PAGE 
in Figure 1, lanes 6 and 7. An additional intense 
60 kDa band was present (Figure 1, lanes 6 and 7), 




Figure 1. Purification of HPV11 and HFV16 LI proteins after recombinant expression in E. coli. Shown is an SDS- 
PAGE analysis of GST-LI and thrombin-released LI proteins. Protein standards (lane 1); whole cell lysates induced 
and uninduced for HPV11 GST-LI (lanes 2 and 3) and HPV16 GST-LI (lanes 4 and 5) expression; GST-LI fusions 
bound to glutathione resins for HPV11 (lane 6) and HPV16 (lane 7); glutathione affinity column eluates after throm- 
bin cleavage for HPV11 (lane 8) and HPV16 (lane 9); LI proteins precipitated after ATP-MgCl 2 treatment of the 
eluates in lane 8 (lane 10) and lane 9 (lane 11); glutathione affinity column eluates for HPV11 (lane 12) and HPV16 
GST-LI (lane 13) for preparations using the complete re-folding protocol (see Materials and Methods); eluates after 
thrombin cleavage from glutathione affinity column for HPV11 (lane 14) and HPV16 (lane 15) LI proteins after using 
the re-folding purification protocol. The GST-LI proteins of both virus types have an apparent molecular mass of 
80 kDa (lanes 2, 4, 6, 7, 12, and 13). The 60 kDa species (lanes 6, 7, 8, and 9) is GroEL, as determined by N-terminal 
sequencing. Without using the re-folding protocol (lanes 6 to 9) the GroEL quantitatively co-eluted with LI. After 
using the refolding protocol, the GroEL contamination was almost eliminated (compare lanes 8 and 9 with lanes 14 
and 15). 



Purification and Assembly of HPV1 1 and HPV16 L 1 



which could not be removed by extensive washing 
with various non-denaturing buffers. N-terminal 
sequencing identified the 60 kDa band as GroEL, 
an essential E. coli chaperone protein, which 
together with groES assists the folding of many 
proteins (Chen & Sigler, 1999; Weissman et al, 
1994, 1995). Electron microscopy (EM) analysis also 
detected GroEL double heptamers in the column 
eluate (data not shown; Chen et al, 1994). 

Thrombin digestion of the column-bound GST- 
LI released LI from the GST column, and this LI 
is seen as a band migrating just below the 60 kDa 
GroEL band (Figure 1, lanes 8 and 9). GroEL also 
eluted from the column together with the cleaved 
LI (Figure 1, lanes 8 and 9). The co-eluted GroEL 
could not be separated from LI by gel-filtration or 
ion-exchange chromatography, suggesting a tight 
association. ATP-MgCl 2 releases GroEL from its 
substrate (Lissin et al, 1990; Viitanen et al., 1990). 
When ATP and MgCl 2 were added to the Ll- 
GroEL eluate, a white precipitate formed. SDS- 
PAGE demonstrated that the precipitate was pure 
LI (Figure 1, lanes 10 and 11). LI could be separ- 
ated from GroEL by treating the thrombin eluate 
with 2.5 M urea, but again all the LI released from 
GroEL precipitated. These findings suggested that 
LI was bound to GroEL in a such a way that its 
release led to an insoluble product. 

If LI was bound to GroEL in a "non-native" 
conformation, ATP treatment in the presence of 
GroES should facilitate release of "native" LI pro- 
tein. Therefore, the whole cell lysate, which also 
contains GroES, was treated with ATP and MgCl 2 . 
GST-LI was then purified by glutathione- 
Sepharose chromatography, yielding a GST-LI 
fusion with reduced GroEL contamination (about 
10-20%). A similar reduction in the amount of 
GroEL was observed if the whole cell lysate was 
incubated with 3.5 M urea (see Materials and 
Methods). However, if the cell lysate was first 
treated with ATP and then with 3.5 M urea, the 
level of GroEL co-purifying with GST-LI was 
reduced to approximately 1 % of the total protein 
(Figure 1, lanes 12 and 13), and the yield of LI was 
increased two- threefold. The LI released from the 
column by thrombin digestion was soluble and 
homogeneous (Figure 1, lanes 14 and 15) 
and could be concentrated to approximately 
20 mg/ml- 1 . 

The purified LI protein of HPV11 migrated on 
SDS-PAGE with an apparent molecular mass of 
57 kDa (Figure 1, lane 14). In the case of HPV16 
LI, however, there was a species of approximately 
54 kDa in addition to the 57 kDa band (Figure 1, 
lane 15). The lower-to-upper band ratio increased 
with storage at 4°C, suggesting proteolysis at 
either the amino or carboxy terminus. N-terminal 
sequencing of these two bands demonstrated an 
intact HPV16 LI amino terminus (data not shown), 
indicating proteolysis at the carboxy terminus. 

The purification protocols for the HP VI 6 and 
HPV11 LI proteins are described in detail in 
Materials and Methods. The protocol is the same 



for both HPV11 and 16, except that the highest 
urea concentration for washing the glutathione- 
Sepharose column is 3.0 M for HPV11 LI, but only 
2.3 M for HP VI 6 LI. A urea concentration greater 
than 2.3 M denatured and precipitated the HP VI 6 
protein. A urea concentration of 3.5 M urea is 
optimal, however, for extraction of both HPV11 
and 16 LI from the total cell lysates. Using these 
conditions, approximately 3-5 mg of near homo- 
geneous LI was obtained from one liter of bacterial 
cell culture. 



L1 Characterization 

The purified LI protein was analyzed by FPLC 
gel-filtration chromatography and negative-stain 
EM analysis (data not shown). Superdex-200 chro- 
matography demonstrated a protein peak of about 
300 kDa, consistent with pentamers. EM analysis 
showed typical pentamer "donuts", as previously 
seen for both LI and VP1 purified after recombi- 
nant expression. 

When LI pentamers were digested with trypsin, 
the full-length 57 kDa LI was reduced to a 42 kDa 
species on SDS-PAGE (Figure 2); the trypsin- 
digested pentamers appeared similar to untreated 
pentamers in the electron microscope (Li el al., 
1997). N-terminal sequencing of the tryptic 42 kDa 
products recovered from SDS-PAGE identified an 
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Figure 2. Trypsin digestion of purified HPV11 and 
HPV16 LI proteins. Shown is an SDS-PAGE analysis of 
untreated (lanes 1, 3) and trypsin-digested HPV11 and 
HPV16 LI proteins (lanes 2 and 4). In addition to a 
42 kDa fragment common to both LI types, trypsin 
digestion generated a 9 kDa product for HPV11 LI and 
two species of approximately 4-5 kDa for HPV16 LI. 



176 



Purification and Assembly of HPV1 1 and HPV16 L1 



intact amino terminus for both for both HPV11 
and HPV16 LI. N-terminal sequencing identified 
internal amino terminal sequences for HPV11 LI as 
S 348 -A-T-Y-T-N, and Y 417 -V-Q-S-Q-A (correspond- 
ing to cleavage at K347 and K416), and for HP VI 6 
LI as N 358 -T-N-F-K-E and F 421 -V-T-S-Q-A (corre- 
sponding to cleavage at K357 and K420). A minor 
cleavage site was detected at K434 for HPV11 and 
K436 for HPV16. 

When trypsin-digested LI pentamers, presum- 
ably containing the 42 kDa fragments, were 
analyzed by Superdex-200 FPLC (15/60 column) 
they eluted at a position with an apparent molecu- 
lar mass of 300 kDa, similar to that of the full-length 
LI. High-resolution SDS-PAGE analysis of the pen- 
tamer peak of trypsin-digested LI from the Super- 
dex-200 gel-filtration showed the 42 kDa band, and 



an additional 9 kDa band for HPV11 (Figure 2, lane 
2), and two other bands, of approximately 4 and 
5 kDa for HP VI 6 (Figure 2, lane 4). Thus the smal- 
ler tryptic carboxy-terminal fragments remained 
associated with the larger 42 kDa fragment during 
gel-filtration chromatography, implying that the 
carboxy-terminal tryptic fragments are integral 
parts of the LI pentamer structure. 



Carboxy-terminal sequences required for L1 
pentamer formation 

To determine the sequences at the carboxy termi- 
nus necessary for proper LI folding, we generated 
a series of carboxy-terminal deletions for both 
HPV11 and HPV16 LI. HPV11 LI deletions were 
made as intein fusions to the LI carboxy terminus, 
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Figure 3. Pentamerization of carboxy-terminal deletion mutants of LI proteins. The presence of a 42 kDa fragment 
after trypsin digestion was used as the criterion for proper folding and pentamer formation, (a) HPV11 LI deletions 
were expressed as Ll-intein fusions and whole cell lysates (incubated for 15 minutes with ATP-MgCl 2 ) were partially 
purified by chitin afffinity chromatography. SDS-PAGE analysis shows the LI proteins (cleaved from the Ll-intein 
fusion by incubating the chitin resin with 20 mM DTT overnight followed by elution with three bed volumes of buf- 
fer L) before and after trypsin digestion. LI was not well separated from GroEL (lanes 1, 3, 5, and 7). LI degradation 
to smaller species was seen for AC33 and longer deletions. After trypsin digestion, only AC26 yielded the 42 kDa 
fragment (lanes 2, 4, 6, and 8). The bands remaining at the top of the gel are GroEL. (b) HP VI 6 LI deletions were 
expressed as GST-LI fusions and whole cell lysates (incubated for 15 minutes with ATP-MgCl 2 ) were partially puri- 
fied by glutathione affinity chromatography. The protease was added directly to the GST-LI fusions bound to gluta- 
thione-Sepharose. SDS-PAGE analysis shows the LI proteins (as GST-LI fusions on the resin) before and after trypsin 
treatment. Before trypsin digestion, the amounts of intact GST-LI for AC46, AC86 and AC109 (lanes 7, 9, and 11) 
were decreased in comparison with AC16 and AC30 (lanes 2 and 5). Lane 3 was AC16 treated with thrombin. After 
trypsin digestion, although free GST was seen in all cases (lanes 4, 6, 8, 10, and 12), the 42 kDa product was seen 
only for AC16 and AC30. The 60 kDa GroEL remained unchanged. GST bands after trypsin digestion were more 
intense than the intact GST-LI bands for AC46, AC86 and AC109, suggesting the presence of non-intact GST-LI 
fusions. 
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while HPV16 LI truncations were made as GST 
fusions to the LI amino terminus (Figure 3). The 
criteria for correct pentamer oligomerization and 
folding was the presence of the 42 kDa trypsin- 
resistant species after trypsin digestion of the puri- 
fied deleted proteins. 

The HPV11 Ll-intein constructs included AC26 
(26 residues deleted), AC33, AC59 and AC115 
(Figure 3(a)). The expressed fusion proteins of each 
construct were partially purified by chitin affinity 
chromatography (Figure 3(a)), and the column elu- 
ates containing both GroEL and LI were digested 
with trypsin and analyzed by SDS-PAGE 
(Figure 3). Trypsin digestion of AC26 yielded the 
42 kDa trypsin-resistant band on SDS-PAGE 
(Figure 3(a), lane 2), indicating proper folding. For 
the other three carboxy-terminal deletions, how- 
ever, the 42 kDa band disappeared after trypsin 
digestion (Figure 3(a), lanes 3-8), indicating that 
the products of these deletions were in a non- 
native conformation. EM analysis of these proteins 
demonstrated the presence of pentamers only for 
the AC26 protein (data not shown), consistent with 
the trypsin digestion results. 

Carboxy-terminal deletions of the HPV16 GST- 
LI fusion behaved similarly to the HPV11 Ll-intein 
fusions. The HPV16 GST-LI deletions included 
AC16, AC30, AC46, AC86 and AC109 (Figure 3(b)). 
SDS-PAGE analysis of the glutathione-Sepharose 
before addition of trypsin showed the presence of 
both the GST-LI deletion fusion proteins and 
GroEL (Figure 3(b), lanes 2, 5, 7, 9 and 11). Trypsin 
digestion of the AC16 and AC30 fusions yielded 
the 42 kDa band (Figure 3(b), lanes 4 and 6), AC46 
and longer deletions of LI were degraded comple- 
tely (Figure 3(b), lanes 7-12). Again, EM analysis 
revealed that only AC16 and AC30 had a penta- 
mer appearance. Only a single band was present 
on SDS-PAGE for the HPV16 AC30 protein after 
purification and thrombin cleavage (data not 
shown), corresponding to the lower band of the 
doublet for full-length LI (Figure 1, lane 15). This 
result implies that a protease-sensitive site on full- 
length HPV16 LI lies between the carboxy-terminal 
26-30 residues, where there are indeed three lysine 
residues. 



Amino-terminal sequences required for L1 
pentamer formation 

The N-terminal sequencing results showed that 
the LI amino terminus is unaffected by trypsin 
digestion. In order to define the role of amino 
terminus region in LI folding and pentamer for- 
mation, we made amino-terminal deletions in both 
HPV11 and HPV16 LI (Figure 4). For Ll-intein 
fusions of HPV11, AN5, AN8, AN14 and AN19 
were expressed, the fusion proteins partially puri- 
fied (Figure 4(a)) and subjected to trypsin diges- 
tion. The trypsin-resistant 42 kDa band was 
present for AN5 and AN8 (Figure 4(a), lanes 1, 2 
and 5, 6), faintly present for AN14, and almost 
absent for AN19 (Figure 4(a), lanes 3, 4 and 7, 8). 
This result suggests that AN5 and AN8 proteins 
fold into trypsin-resistant pentamers, whereas 
longer deletions result in either only small amounts 
of the folded pentamers (AN14) or none at all 
(AN19). 

For HPV16 GST-LI, the deletion constructs are 
shown in Figure 4(b). Trypsin digestion of the par- 
tially purified GST-LI yielded the 42 kDa trypsin- 
resistant band for AN8, AN10 and to some extent 
for AN13 (Figure 4(b), lanes 1, 2, 3 and 5, 6, 7). The 
GST-LI fusion of AN13 was not efficiently 
digested by trypsin (Figure 4(b), lanes 3 and 7); 
some full-length GST-LI fusion was present even 
after prolonged incubation with trypsin. For AN20 
(and AN30, not shown), the GST-LI fusion pro- 
teins and GroEL were not detected by SDS-PAGE 
(Figure 4(b), lane 4). Instead, a broad band with 
the mobility of GST was present (Figure 4(b), lanes 
4 and 8). It is possible that the LI moieties of these 
GST-LI fusions were degraded after expression but 
before purification. 



Effect of deletions on in vitro capsid assembly 

HP VI 6 LI deleted proteins capable of pentamer 
formation were tested for in vitro self-assembly 
into higher-order structures (Table 1). All the 
amino-terminal and carboxy-terminal deletions 
except AN10 assembled into T = 7 particles under 
the conditions used (Figure 5(a)), but AN10 



Table 1. The assembly of HPV16 LI deletion mutants under different pH values 
Size of assembled particles 



Constructs pH 4.0 pH 5.4 pH 6.2 ~ pH 7.5 pH 8.5 



A N-0 


T7 


T7 


T7 


T7/pent 


pent/T7 


A 


NT 


T7 


T7 


T7/pent 


NT 


A N-6 


NT 


T7 


T7 


T7/pent 


NT 


A N-8 


NT 


T7 


T7 


T7/pent 


pent/T7 


A N-10 


Tl 


Tl 


Tl 


Tl 




A N-10 + Gly 


T7 


T7 


T7 


NT 


NT 


A N-10 + Thr(AN-9) 


T7 


T7 


T7 


NT 


NT 


A C-16 


T7 


T7 


T7 


NT 


NT 


A C-30 


T7 


T7 


T7 


NT 


NT 



Assembly properties of different HPV16 LI truncation mutants at different pH. The assembly buffers were: pH 4.0 and 5.4, 
40 mM sodium acetate; pH 6.2 and 7.5, 40 mM Hepes; pH 8.5, 40 mM Tris-HCl; all contained 1 M Nacl. NT, not tested. 
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Figure 4. Pentamerization of ammo-terminal deletion mutants of LI proteins. The presence of a 42 kDa fragment 
after trypsin digestion was used as a criterion for proper folding and pentamer formation, (a) HPV11 LI deletions 
were expressed as Ll-intein fusions and whole cell lysates (incubated for 15 minutes with ATP-MgCl 2 ) were partially 
purified by chitin affinity chromatography. SDS-PAGE analysis shows the LI proteins (as intein-Ll fusions on the 
resin) before and after trypsin treatment. Before trypsin digestion, full-length fusions are seen at the top of the gel for 
AN5 and AN8 (lanes 1 and 2). This band intensity decreases for AN14 (lane 3) and almost disappears for AN19 
(lane 4). There is an intense band at the position of 60 kDa for each lane, which contains mostly intein-CBP (Chitin 
Binding Protein) and a small amount of GroEL (confirmed by N-terrrunal sequencing). The free intein-CBP implies 
extensive proteolysis of the fusion between LI and intein. The cleaved LI (together with some GroEL) does not bind 
chitin beads during purification. After trypsin treatment, the 42 kDa band is present for AN5 and AN8 (lanes 5 and 
6), to some extent for AN14 (lane 7), but absent for AN19 (lane 8). Free chitin-CBP is digested to smaller fragments 
(lanes 5 and 8). (b) HPV16 LI deletions were expressed as GST-LI fusions, and whole cell lysates (incubated for 15 
minutes with ATP-MgCl 2 ) were partially purified by glutathione affinity chromatography. SDS-PAGE shows the LI 
proteins (as GST-LI fusions on the resin) before and after trypsin treatment. Before trypsin digestion, full-length 
fusions for AN8, AN10, and AN13 migrate at the top of the gel (lanes 1, 2, and 3). Only free GST is present for 
AN20, possibly due to proteolytic degradation of the LI portion during purification (lane 4). After trypsin digestion, 
the 42 kDa fragment is present for AN5 and AN10 (lanes 5 and 6). AN13 shows some 42 kDa and undigested fusion 
proteins (lane 7). 



assembled into T = 1 particles under all conditions 
tested (Figure 5(b)). The precise deletion was criti- 
cal. When even one Gly or Thr residue (the tenth 
LI residue) was added to the amino terminus of 
AN10 (AN10 + Gly or AN10 + Thr), the proteins 
assembled into T-7 particles (Figure 5, (c) and 
(d)). 

The appearance of assembled products for each 
construct were generally the same in buffers with 
different pH (Table 1). The most consistent results 
were obtained at pH 5.4. Clumping of pentamers 
tended to become more severe at high pH, possibly 
due to disulfide bond formation of the free cysteine 
residues, and oligomeric pentamers (aggregates) 
were the predominant species at pH 8.5. 

Recently the AN10 deletion of HPV16 LI was 
crystallized as a T = 1 particle, and its X-ray struc- 
ture determined (Chen et al, 2000). The structure 
showed that the carboxy terminal segment from 
residue 384-446 folds into three helices with con- 
necting loops and turns (Figure 6). These helices 



constitute the primary interpentamer bonding con- 
tacts in the assembled T = 1 particle. To test 
whether these helices also affect particle assembly, 
LI proteins were generated with a specific deletion 
of helix 4 (see the structure in Figure 6) for both 
HP VI 6 (residues 408-431), and HPVll(residues 
409-429). Pentamers were purified for these deleted 
proteins, as demonstrated by their FPLC elution 
profile and a donut appearance seen by EM. No 
assembly of particles from these pentamers was 
found under any condition tested (Figure 5(e)), 
suggesting that this carboxy-teirninal helical 
domain is essential for particle assembly. 



Discussion 

Previous efforts to express HPV LI in E. coli 
have produced only modest quantities of 
assembly-competent pentamers. The protocol 
described here gives a substantially enhanced yield 
of pure LI, using a relatively straightforward 
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Figure 5. In vitro self-assembly of carboxy and amino-terminal deleted HPV16 LI proteins. Electron micrographs of 
assembly reactions conducted in 1 M NaCl and 40 mM sodium acetate (pH 5.4), for purified LI proteins. The bar 
length represents 600 A, the diameter of a papillomavirus virion, (a) T = 7 particles (with diameter of about 600 A) 
assembled from AN0 (full-length LI), with many unassembled pentamers in the background, (b) T=l particles 
(diameter of about 300 A) assembled from AN10. (c) and (d) show the assembly products for AN10 + Gly and 
AN10 + Thr. The particles in (c) and (d) have a diameter of approximately 600 A (T = 7), like those of AN0. (e) Free 
pentamers are the only products of in vitro assembly for A409-431L1. 



purification strategy. Two factors appear to have 
been critical: (1) the use of GST-LI fusion proteins, 
which greatly facilitated purification; and (2) a 
purification strategy that separated native LI from 
the GroEL-bound complex. Despite the 27 kDa 
amino-terminal addition of the GST moiety, the 
fusion polypeptides fold and form pentamers 
properly, as evidenced by the release of LI 
pentamers from the fusions by thrombin and their 
ability to be crystallized (Chen et <xl, 2000). 

GroEL release was accomplished by adding 
ATP-MgCl 2 to the cell lysate, followed by treat- 
ment with 3.5 M urea. We observed that 3.5 M 



urea alone released about half of the GroEL. EM 
analysis demonstrated that GroEL double hepta- 
mers began to disassemble into single subunits in 
2.0 M urea and disassembled completely in 3.5 M 
urea, while LI pentamers withstood these urea 
concentrations. ATP-MgCl 2 alone also released 
about half of the GroEL. Neither urea nor ATP 
could be used used to release GroEL from the puri- 
fied Ll-GroEL complex. Factors in the cell lysate 
presumably participated in the success of both 
treatments. In the case of ATP treatment, these fac- 
tors may have included GroES, which is required 
for normal ATP-dependent release of proteins from 




Figure 6. Three-dimensional 
structure of the HPV16 LI 
monomer (Chen el «/., 2000). The 
orientation in (b) is rotated 
approximately 120 ° about the verti- 
cal axis from the orientation in (a). 
The B-srrands are labeled by letter 
B through J. The starting residues 
of most B-srrands and the starting 
and ending residue numbers for 
the carboxy-terminal a-helices are 
identified. Trypsin-sensitive sites 
are marked by arrows. Deletion of 
h4 (helix 4) of both HPV11 and 
HPV16 LI abolished assembly of 
particles (see Figure 5 (e)). 
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GroEL, but why the urea treatment was unsuccess- 
ful when applied later in the purification is less 
evident. 

How can we picture the GroEL/GST-Ll com- 
plex? The staining ratio in Figure 1 indicates about 
three GroEL subunits per GST-LI, or about one 
GroEL double heptamer per LI pentamer. Since 
there is substantially less than one GroEL double 
heptamer per LI monomer, the LI must be largely 
folded and assembled, even with GroEL attached. 
The release of the GroEL by treatment with ATP- 
MgCl 2 in the presence of factors from the E. coli 
lysate may therefore proceed as a "normal" 
completion of the GroE cycle. Successful use of 
urea to dissociate GroEL and to produce assembly- 
competent LI pentamers would require that the 
part of LI interacting with GroEL be able to fold 
spontaneously once released. 

The preparation of truncated forms of HPV11 
and HPV16 LI allowed further definition of the LI 
domains required for pentamer and particle for- 
mation beyond that previously reported (Chen 
et al, 2000). Up to 30 residues can be deleted from 
the carboxy terminus and at least ten residues 
from the amino terminus, without affecting penta- 
mer formation. The ammo-terminal 20 residues 
and the carboxy-terminal 30 residues are disor- 
dered in the crystal structure, and the deletion 
analysis thus shows that residues not part of the 
core of LI (which we define as residues 21-474 for 
HPV16 and 19-471 for HPV11) are also not essen- 
tial for pentamer assembly. A similar observation 
has been reported for polyomavirus VP1, where a 
protein with 32 residues deleted at the amino 
terminus and 63 residues deleted at the carboxy 
terminus can still form a pentamer amenable to 
crystallization and structure analysis (Stehle et al., 
1994). As in the case of LI, deletions compatible 
with pentamer assembly are ones that leave the 
protein core intact. 

The carboxy-terminal sequences of LI from 
different papillomaviruses are particularly variable. 
For example, the similarity of HPV16 and HPV11 
LI sequences ceases abruptly 30 residues from the 
carboxy terminus, just C-terminal to helix h5 (Chen 
et al., 2000). Immediately preceding this boundary 
in the sequence are the residues that participate in 
the primary interpentameric contacts. Helices h2, 
h3, and h4 (see Figure 6) constitute a small subdo- 
main that projects laterally from the (3-jelly roll of 
LI, and helix h5 anchors this interaction domain 
by inserting back into the central cavity of the LI 
pentamer. Data presented here, and related results 
in the literature, demonstrate that an intact h5 is 
essential for pentamer assembly as well as for sub- 
sequent capsid formation. For example, BPV LI 
forms capsids with a 24 residue carboxy-terminal 
deletion, but not with a 40 residue deletion 
(Paintsil et al, 1996); likewise, stable folding of 
canine oral papillomavirus (COPV) LI tolerates a 
26 residue, but not a 67 residue, deletion (Chen 
et al., 1998). Our finding, that deletion of even 
three residues beyond the 30 residue boundary in 



HPV11 LI (HPV11-L1 AC33) produces an 
unfolded, trypsin-sensitive endproduct, demon- 
strates the importance of the complete LI core for 
pentamer stability, including the helical subdomain 
and its h5 anchor. Our observation that the 42 kDa 
fragment obtained by limited proteolysis is not an 
independently stable entity leads to the same 
conclusion. 

Because the helical subdomain of LI is necessary 
for the integrity and stability of the pentamer itself, 
in addition to being essential for interactions 
among pentamers in the capsid, we presume that 
it adopts its folded structure prior to viral assem- 
bly. That is, a carboxy-terminal segment does not 
exit the LI monomer to form a helix 5 contact with 
an LI in a neighboring pentamer. In contrast, the 
entire, 63 residue, carboxy-terminal arm of polyo- 
mavirus VP1 is disordered on the subunits of free 
pentamers, and it folds into ordered structures 
only when it "invades" a neighboring pentamer as 
capsids assemble (Liddington et al, 1991). 

How does truncation of the amino terminus of 
LI affect the design of the assembled particle? In a 
T = 1 small VLP, all of the carboxy-terminal inter- 
action domains contact neighbors in identical, sym- 
metrically related ways. In the 72 pentamer shells 
of virions, there must be some variability in the 
way these domains interact. Moreover, the curva- 
ture of the virion shell is much smaller than that of 
the small VLP, so the carboxy-terminal domains 
must face their interaction partners at different 
angles from that found in the T = 1 particles. The 
ammo-terminal residues of the polypeptide chain 
lie immediately adjacent to the carboxy-terminal 
interaction domain, and it is reasonable to postu- 
late that their presence prevents the high-curvature 
interaction seen in the small VLPs. A structure for 
the 72 pentamer shell would be required to under- 
stand precisely why the switch to small-shell 
assembly occurs so abruptly with deletion of the 
tenth amino-terminal residue. 

In summary, the protocol described in detail in 
this paper permits expression and purification of 
large quantities of HPV LI proteins. We have 
already used this procedure to determine the struc- 
ture of HPV 16 LI, and we anticipate that the same 
methods will be useful for crystallizing LI proteins 
from additional HPV types. The resulting penta- 
mers and in vitro assembled particles will provide 
reagents for co-crystallization efforts, new sub- 
strates for immunologic studies, and potential new 
vaccine immunogens. 



Materials and Methods 

Cloning and deletion constructs 

LI clones of HPV 11 from human laryngeal (accession 
no. M14119) and of an HPV16 strain from human cervi- 
cal carcinoma (accession no. AF140325) were used for 
this study. LI PCR products were inserted downstream 
of GST coding sequence of pGEX-2T vector (Pharmacia) 
to generate GST fusions at the amino terminus of LI. For 
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HPV11 LI, primer 1 (forward) 5' ACCTAC AGATCT 
GGT ATGTGGCGTCCTAGCGACAGC and primer 2 
(reversal) 5' CA GAATCC TTACTTTTTGGTTTTGG- 
TACGTT were used for amplifying the LI coding 
sequence and the PCR fragment were digested with 
Bgl II/EcoR I for inserting into BamHI/EcoRI-digested 
pGEX2T vector. The primers for HPV16 LI, forward 5' 
CTGA AGATCT GGT ATG TCC CTG TGG CTG CCT 
AGT GAG GCC ACT GTC and reversal 5' CAGTTC 
GATATC TTAACGCTTACGTTTTTTGCGTTTAGC were 
used for PCR and the PCR product was digested with 
Bglll/EcoRV for inserting into BamHl/Smal sites of 
pGEX2T. 

The carboxy-terminal fusion constructs of HPV11 LI 
with intein, were generated by cloning LI PCR frag- 
ments into the pCYBl vector (IMPACT I system, NEB 
Biolabs). The primers for PCR cloning were: forward 
5' CAATGTCA CATATG TGGCGTCCATCTGAT 
AGCAC TGTATACGTCCACCTCCAAACCCTGTATCC 
an d reve rsal 5' AGACTT GCTCTTCC GCAGTTTTT 
GGTTTTGGTACGTTTTCG. The PCR product was 
digested with Ndel/Sapl and then ligated into Ndel/Sapl- 
digested pCYBl vector. E. coli strain XA90 [AlacproXIII, 
nalA, argE(am), thi, rifr/F'lacP'ZY.prbAB] was used for all 
the cloning and expression studies. This strain consist- 
ently expressed LI fusions to higher level than other 
strains tested. 

Deletion mutants were constructed using PCR based 
on the full-length LI fusion clones. Sets of two primers 
were used for PCR to amplify the whole plasmid con- 
taining LI gene except for the nucleotides to be deleted. 
The PCR fragment contained a new unique restriction 
site on both ends (incorporated into the primers) for 
digestion and ligation. Clones were first identified by 
digestion of the newly generated site and then further 
confirmed by DNA sequencing. This strategy proved to 
be a very efficient deletion method. Plasmids of 6-8 kb 
were routinely amplified by PCR using the following 
conditions: 94 °C for 1.5 minutes, 65 °C for 1 minute, 
68 °C 12 minutes, two cycles of PCR, then 94 °C for 1.5 
minutes, 68 °C for 13 minutes, 23 cycles of PCR. The 
reaction contained five units of Taq polymerase and 0.25 
unit of Phi (Strategen). The inclusion of Pfu at the con- 
centration of 1/20 Taq units in the PCR reaction was 
essential to obtain large quantities of full-length PCR 
products for cloning. 



Protein Purification 

The detailed protocols of protein expression and puri- 
fication of both HPV11 and HPV16 LI are described as 
follows. A 2 ml overnight cell culture was inoculated to 
one liter of 2x YT at 37 °C. Cells were grown until] 
^600 = 0.1-0.3. The flasks were transferred to 25 °C and 
incubated with shaking for 30 minutes. IPTG was added 
to 0.2 mM and protein expression induced for 16-20 
hours. Cells from a one liter culture were resuspended in 
30 ml of buffer L (50 mM Tris-HCl (pH 8.0), 0.2 M NaCl, 
1 mM DTT, 1 mM EDTA) and lysed by sonication. After 
sonication, ATP and MgCl 2 were added to final concen- 
trations of 2 mM and 5 mM, respectively. After one hour 
incubation at room temperture, urea (ultrapure grade) 
was slowly added to the lysate to a final concentration 
of 3.5 M. The mixture was incubated at room tempera- 
ture for one hour with gentle shaking, and then dialyzed 
against three changes of buffer over 18 hours. The lysate 
was clarified by centrifugation at 25,000 g for 75 minutes. 
The GST-LI (or LI -intein) in the supernatant was 



purified using a glutathione affinity column (or chitin 
chromatography) using 15 ml bed volume for a lysate 
from eight liters of bacteria. After washing the column 
with ten bed volumes of buffer L, residual GroEL was 
eluted by washing with ten bed volumes of 3.0 M urea 
(in buffer L) for HPV11 LI or 2.3 M urea for HPV16 LI. 
LI was cleaved from the GST fusion using thrombin 
(Sigma, T6634) in an approximate ratio of 100 ug GST-LI 
to one NIH unit of enzyme. The digestion was carried 
out at 4 °C overnight. LI was further purified by Super- 
dex-200 (16/60 column) gel-filtration FPLC. Ll-intein 
fusions were cleaved by incubation with DTT at 4°C 
overnight (IMPACT I system of New England BioLabs). 

Trypsin Digestion and In vitro Assembly 

Limited trypsin digestion of LI (trypsin to LI ratio of 
1:1000) was performed at room temperature, and the 
digestion was terminated by adding 1 mM PMSF and an 
equal volume of 2x SDS sample buffer followed by 
immediate boiling. Assembly assays were conducted 
using two methods. First, LI protein was dialyzed (at a 
concentration of approximately 0.1 mg/ml) against the 
assembly buffer in a micro-dialysis button (Pierce) for 60 
minutes. Second, concentrated LI protein (10-20 mg/ml) 
was diluted 100-1000-fold directly into assembly buffer 
and incubated at room temperature for 30 minutes. Both 
methods gave equivalent results. For EM analysis, the LI 
protein was spotted on glow-discharged, carbon-coated 
grids (EM Science) and stained with 2% (w/v) uranyl 
acetate. A JEOL 100-CX electron microscope was used 
for visualization. 
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The LI major capsid protein of human papillomavirus type 11 (HPV-11) was expressed in Escherichia coli, 
and the soluble recombinant protein was purified to near homogeneity. The recombinant LI protein bound 
DNA as determined by the Southwestern assay method, and recombinant mutant LI proteins localized the 
DNA-binding domain to the carboxy-terminal 11 amino acids of LI. Trypsin digestion of the full-length LI 
protein yielded a discrete 42-kOa product (trpLl), determined by sodium dodecyl sulfate-polyacrylamide gel 
electrophoresis, resulting from cleavage at R415, 86 amino acids from the LI carboxy terminus. Sucrose 
gradient sedimentation analysis demonstrated that trpLl sedimented at US, while LI proteins with amino- 
terminal deletions of 29 and 61 residues sedimented at 4S. Electron microscopy showed that the full-length LI 
protein appeared as pentameric capsomeres which self-assembled into capsid-like particles. The trpLl protein 
also had a pentameric morphology but was unable to assemble further. In an enzyme-linked immunosorbent 
assay, the trpLl and LI capsids reacted indistinguishably from virus-like particles purified after expression of 
HPV-11 LI in insect cells. The carboxy terminus of LI therefore constitutes the interpentamer linker arm 
responsible for HPV-11 capsid formation, much like the carboxy-terminal domain of the polyomavirus VP1 
protein. The trypsin susceptibility of HPV-11 LI capsids suggests a possible mechanism for virion disassembly. 



Papillomaviruses are a family of nonenveloped, double- 
stranded DNA viruses which infect many species, and at this 
time more than 70 different types of human papillomaviruses 
(HPVs) have been identified. Image analysis of cryoelectron 
micrographs of bovine papillomavirus type 1 (BPV-1) and 
HPV-1 has demonstrated a common structure comprised of 72 
pentameric capsomeres arranged on a T=7 icosahedral capsid 
lattice (2). This overall structure is similar to that previously 
described for simian virus 40 (SV40) and murine polyomavirus 
(15, 27), although possible differences in capsomere morphol- 
ogy and intercapsomere contacts are suggested by the image 
reconstruction. The papillomavirus genome encodes two struc- 
tural proteins, LI and L2, synthesized late in infection. In the 
virion, the ratio of LI (55 kDa) to L2 (74 kDa) has been 
estimated over a range from 5:1 to 30:1 (26a). By analogy to 
polyomavirus and SV40, the capsomeres are likely formed by 
pentamers of LI. L2 may associate with the LI pentamer, 
although given the uncertain ratio of LI to L2, all or a subset 
of the pentamers may be occupied. 

Papillomaviruses are difficult to propagate in vitro. There- 
fore, recombinant expression systems have been used to ex- 
press the LI and L2 proteins in order to characterize their 
immunologic characteristics as well as to analyze their struc- 
tural properties. Capsid proteins for HPV-16, BPV-1, HPV-11, 
and HPV-33 have been expressed in insect cells with baculo- 
virus vectors (12, 23, 28), and those for HPV-1, HPV-16, and 
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BPV-1 have been expressed in mammalian cells with vaccinia 
virus (10, 30). These expression systems all generate virus-like 
particles (VLPs) similar in appearance to empty capsids, and 
these VLPs have immunologic characteristics suggesting a na- 
tive conformation. When cells expressing LI are examined by 
electron microscopy, the VLPs are seen exclusively in the nu- 
clei of the cells (12, 23, 30). These results are also analogous to 
observations with polyomavirus (17), suggesting not only com- 
mon structural features between these virus families but also 
similar control of the intracellular site of capsid assembly. 

The polyomavirus major capsid protein VP1, purified after 
expression in Escherichia coli, self-assembles in vitro into cap- 
sid-like structures (25). This self-assembly reaction has pro- 
vided a useful method for assaying conditions favoring capsid 
assembly as well as for examining mutant VP1 proteins for 
their ability to assemble (8, 26). In order to analyze the assem- 
bly properties of LI, we have purified the HPV-11 LI protein 
after expression in E. coli. This recombinant protein also self- 
assembled in vitro into capsid-like particles; however, trunca- 
tions of the LI protein revealed a domain structure that differs 
from that of the polyomavirus VP1 protein. 

MATERIALS AND METHODS 
Plasmid constructions. The full-length HPV-11 LI DNA sequence was ob- 
tained by PCR amplication from the plasmid pVLULl (23), with a forward 
primer, 5'-GCCGCGAAGCT TCATATG TGGCGGCCTAGCGACAG. con- 
reverse d primer, ^-GGGCCTOGATCCAC^ 

AC, containing a Bg/II restriction enzyme site at the 3' terminal sequence. The 
PCR-amplified fragment was purified with a Geneclean 11 Kit (Bio 101) and then 
subcloned into a PCR II vector (Invitrogen). To generate the pET-3a-HPV-ll 
LI plasmid, the plasmid pET-3a-SV40Agno (4a) was digested with Ndel and 
BamWl, and the Nde\-Bgt\l fragment of HPV-11 LI subcloned from the PCR 
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The mutant LI coding sequences with amino-lerminal and carboxy-terminal 
deletions were generated with the cloning methods described above. Two pro- 
teins with amino-tcrminal deletions, NA2QL1 and NA61L1, were PCR amplified 
with two forward primers, 5'-GCGAAGCTTCATATGACCAACATATTTTAT 
CATGCC and 5'-GCGAAGCTTCATATGGTGTCTGGATATCAATATAGA, 
and a reverse primer as described above. Two mutant LI proteins with carboxy- 
terminal deletions, CA11L1 and CA86L1, were PCR amplified with a forward 
primer as described above and two reverse primers, 5'-CCTGGATCCAGATC 
TCATGTAGAGGGCTTAGACACAGC and 5'-CCTGGATCCAGATCTCAT 
CTATAAGTATCCTCCAGTGT. 

Recombinant LI expression and purification. LI proteins were expressed 
from the T7 promoter of pET-3a-HPV-l 1 LI in the host bacteria BL21(DE3). 
Bacteria were grown at 37°C in MQ medium with ampicillin (100 u.g/ml) over- 
night. Fresh medium (50 ml) was then inoculated with 1 ml nf [his overnight 
culture, and cells were grown for 6 to 8 h at 37°C. This culture was used to 
inoculate 500 ml of fresh medium containing 100 p.g of ampicillin per ml. 
Bacteria were induced with 1 mM isopropyl-fj-D-thiogalactopyranoside (IPTG) 
and grown for 6 h. The cells were then pelleted and stored at -20°C. 

Bacteria from 2 liters of culture were thawed in 200 ml of buffer A with 0.25 
M NaCI (50 mM Tris-HCl [pH 7.91, 5% glycerol, 2 mM EDTA, 15 mM 2-mer- 
caploethanol, 250 mM NaCI), and lysozyme was added to 200 u.g/ml. After 20 
min at 4°C, sodium deoxycholate and phenylmethylsulfonyl fluoride were added 
to final concentrations of 0.05% (wt/vol) and 1 mM, respectively. All subsequent 
steps were carried out at 4°C. The cell suspension was sonicated with a Heal 
System Ultrasonics W-225 for three 40-s bursts, at 1-min intervals. The sonicated 
suspension was subjected to Uounce homugenizalion 20 limes with a B pestle. 
The homogenate was centrifuged at 12,000 X g for 20 min, and Polymin P was 
added slowly to the supernatant to a final concentration of 0.5% (wt/vol) and 
incubated for 20 min. After cenlrifugation at 10,000 X g for 15 min, the pellet was 
resuspended in 50 ml of buffer B with 1 M NaCI (10 mM Tris-HCl [pH 7.9|, 5% 
glycerol, 2 mM EDTA, 15 mM 2-mcrcaptocthanol, 1 M NaCI) with a Douncc 
homogenizes The suspension was centrifuged at 10,000 X g for 15 min. The 
pellet was recxtracted with 50 ml of the same buffer, and the supernatants were 
combined and precipitated by the addition of ammonium sulfate to 35% satu- 
lim rhep 1 from the lirsl homogenate was extracted with 100 ml of buffer 

1 vith 1 M NaCI with a Douncc homogenizci flu uspension v si nlnfuged 
at 10,000 X g for 15 min, and the supernatant was precipitated with ammonium 

ulfaleasd bed a 1 prceipilal i uspend 20 1 buffer C 
with 0.1 M NaCI (10 mM Tris-HCl [pH 7.2], 5% glycerol 2 mM EDI \ 15 mM 

2 '< r a| methanol, 100 mM NaCI) and dialyzcd against the same buffer. The 
insoluble material was removed by ccntrifugalion at 10,000 X g for 20 min. The 
supernatant was subjected to chromatography on a DE 2i lulo olur in 
buff er C with 0.1 M NaCI. The flowthrough was applied to a PI 1 phosphocellu- 
losc (Whatman) column. The column was washed once with buffer C with 0.25 
M NaCI and once with buffer C with 0.5 M NaCI. LI was eluted with buffer C 
with 1 M NaCI. Further purification was carried out by pelleting the protein from 
sucrose gradients (see below) or directly from the 1 M NaCI eluant by cenlrif- 
ugation at 120,000 X g for 20 min. Purification steps were monitored by sodium 
dodccyl >ulfatc-polyacrylamidc gel electrophoresis (SDS-PAGE) with Coomas- 
sie blue staining and immunoblotting (23). 

Trypsin digestion and protein sequence determination. Purified L1 protein 
was dialyzcd against trypsin digestion buffer (10 mM Tris-HCl, 2 mM EDTA, 15 
mM 2-mercaploethanol, 100 mM NaCI, 100 mM NaHCOj, pH 7.8). Trypsin 
digestion was carried out for 2 h at 4, 25, or 37°C, with the ratio of protein to 
trypsin at 1:100 (wt/wl). Trypsin-digested 1.1 peptides were resolved by SDS- 
7.5% PAGE and electroblotted to an Immobilon-P membrane in CAPS buffer 
(10 mM 3-[cyclohexylamino]-l-propanesulfonic acid, 10% methanol, pH 11.0) 
{1 ). Proteins were stained with 0.1% Coomassic blue R-250 in 50% methanol for 
5 min and destained with 50% methanol-10% acetic acid with several changes. 
The LI proteolytic fragments were excised and subjected to protein sequencing. 
The amino-lerminal 8 amino acids of the trypsin products were determined with 
an Applied Biosyslcms gas phase sequencer. 

Southwestern assays. LI was separated by SDS-9% PAGE, transferred to 
nitrocellulose membranes, and then incubated for 20 min with 2% bovine serum 
albumin. Proteins were renatured in buffer D (50 mM Tris-HCl [pH 7.4], 1 mM 
EDTA, 200 mM NaCI, 10% glycerol, 0.1% Nonidet P-40) at 4°C overnight. The 
DNA probe was prepared from the plasmid SP65 digested with EcoRI and 
labeled with [ 32 P]dCTP, using the Prime-a-gene System (Promega). The DNA 
binding assay was carried out in buffer E (30 mM HEPES [pH 7.4], 5 mM MgCI 2 , 
50 mM NaCI) with "P-labeled plasmid DNA for 30 min at room temperature 
(19). The nitrocellulose filters were washed four times with binding buffer and 
then subjected to autoradiography. 

Sucrose gradient sedimentation of LI proteins. LI fractions eluling from the 
phosphocellulose column in buffer Cwilh I M NaCI were dialyzed against buffer 
A with 0.25 M NaCI. Four hundred microliters was layered onto a 5 to 20% 
sucrose gradient in buffer A with 0.25 M NaCI and centrifuged at 39,000 rpm for 
16 h in an SW41 rotor. Fractions (0.5 ml) were collected from below. The 
tractions were assayed for LI by immunoblotting or Southwestern assay follow- 
ing SDS-PAGE. The LI protein pelleted in the sucrose gradients was near 

Electron microscopy. LI protein obtained after the phosphocellulose purifi- 
cation step was dialyzed into buffer containing 0.1 M NaCI, 20 mM Tris-HCl (pH 



7.2), 1 mM EDTA, and 15 mM 2-mercaptoelhanol. Samples were applied to 
glow-discharged carbon-coated 400-mesh grids and stained with 2% uranyl ace- 
tate. Images were photographed with a Philips CM 10 electron microscope at 
nominal magnifications of X39.000 or X73.000. 

Enzyme-linked immunosorbent assay (ELISA). VLPs were purified as de- 
scribed previously (22, 23). Methods used for the production of rabbit polyclonal 
HPV-11 virion and HPV-11, HPV-16, and HPV-18 VLP antisera have been 
described previously (4, 22, 24). Purified antigens were diluted in phosphate- 
buffered saline to a concentration of 0.01 mg/ml, and 100-u.l aliquots were 
dispensed into 96-wcll microliter plates. The plates were incubated overnight at 
4°C and then blocked for 2 h at room temperature, as previously described (24). 

Threefold serial dilutions of primary antisera were made in a diluent solution 
containing 1% bovine serum albumin (Kirkegaard & Perry Laboratories, Inc., 
Gaithersburg, Md.) and 10% (vol/vol) wild-type Auiographa califomica nuclear 
polyhcdrosis virus-infected cell culture supernatant, added to reduce background 
' tivitics (22) Diluted sera were preabsorbed overnight at 4°C, centri- 
fuged for 1 min at 14,000 X s , and then added in 100-u.l aliquots to replicate wells 
containing antigen (tesl wells) or phosphate-buffered saline alone (control wells). 
Secondary antibody conjugate (anti-rabbit immunoglobulin G-alkaline phospha- 
tase at a 1:2,000 dilution; Kirkegaard & Perry Laboratories) and enzyme sub- 
strate were then added as previously described (24). Specific absorbancc values 
were calculated by subtracting control well values from lest well values, and 
replicate differences were averaged. 



RESULTS 

Purification of the HPV-11 LI protein after expression in E. 

coli. The full-length HPV-11 LI coding sequence was cloned 
into the vector pET-3a to yield pET3aLl, and protein expres- 
sion was induced from the T7 promoter of the vector by addi- 
tion of IPTG. LI was expressed more efficiently in the E. coli 
strain BL21(DE3) than in HMS174(DE3) (data not shown). 
The LI protein was purified according to a protocol based on 
that used for the purification of the polyomavirus VP1 protein 
(14). The steps in the purification were assayed by SDS-PAGE 
and immunoblot analysis with an anti-Ll rabbit antiserum (Fig. 
1) (23). After cell lysis, approximately 50% of the LI was 
soluble. LI remaining in the bacterial cell pellet at this stage 
could be partially recovered by further extraction in a buffer 
containing 1 M NaCI. Under the loading buffer conditions, LI 
was found in the flowthrough fraction from the DE-52 cellu- 
lose column (Fig. 1, lanes 3) and was retained on the Pll 
phosphocellulose column (Fig. 1, lanes 4 and 5). LI eluted in 
1 M NaCI buffer from the phosphocellulose column (Fig. 1, 
lanes 6). This chromatographic behavior is similar to that of 
the polyomavirus VP1 protein (14). The eluant from the phos- 
phocellulose column contained various amounts of minor con- 
taminating proteins, and final purification was achieved with 
sedimentation through a sucrose gradient (Fig. 1, lanes 6 and 

LI binds DNA. Analysis of LI DNA binding was performed 
with the Southwestern blot assay (19). In order to characterize 
the DNA-binding domain identified by this assay, LI proteins 
with amino- and carboxy-terminal deletions were generated by 
site-directed mutagenesis of the LI expression vector. Mutant 
proteins were partially purified after expression in bacteria. 
The LI proteins were separated by SDS-PAGE, electrotrans- 
ferred to nitrocellulose, and hybridized with 32 P-labeled plas- 
mid DNA. As seen in Fig. 2, DNA binding similar to that of the 
full-length protein was seen for two mutant proteins with ami- 
no-terminal deletions, NA29L1 and NA61L1. However, two 
mutant proteins with carboxy-terminal deletions, CA86L1 and 
CA11L1, failed to bind DNA. DNA binding to the full-length 
LI protein was also tested with increasing concentrations of 
NaCI in the Southwestern blot assay, and DNA binding was 
lost when greater than 200 mM NaCI was present in the wash 
buffer (data not shown). 

The mutant CA11L1 also behaved differently from the full- 
length LI protein during its purification, in that CA11L1 eluted 
from the phosphocellulose column in a buffer containing 0.5 M 
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FIG !. Purification ol HI'V-II LI aflci expression in K. coli. SDS-PAGE analysis of LI fractions either slained wilh Coomassie blue (A) or immunoblolled with 
an anli LI inliscrum (B). Lanes: 1, whol ! l t i [ r ipitate 1 llowlhrough from DE-52 column; 4, llowlhrough from phosphocellulose 

column; 5, 250 mM NaCI cluani from phosphocellulose; 6, 1 M NaCI eluanl from phosphocellulose; 7, LI pelleted after sucrose gradient sedimentation. Molecular mass 
standards (in kilodallons) appear on the left. 



NaCI. For the polyomavirus VP1 protein, disulfide reducing 
agents eliminated DNA binding in the Southwestern assay 
(19). However, neither 2-mercaptoethanol nor dithiothreitol 
affected LI DNA binding (data not shown). A DNA-binding 
domain therefore can be localized to the carboxy-terminal 11 
amino acids of LI. 

Trypsinization of recombinant LI yields a discrete 42-kDa 
fragment. Protease sensitivity of viral capsid proteins has been 




FIG. 2. Southwestern analysis of LI mutant proteins. Recombinant LI pro- 
teins were partially purified after expression in E. coli, subjected to SDS-PAGE, 
and either immunoblotted with an anti-Ll antiserum (A) or subjected to South- 
western analysis (B). Lanes: 1, wild type; 2, NA29L1; 3, NA61L1; 4, CA11L1; 5, 
CA86L1. 



informative in assessing their native conformation, and for 
example, trypsin cleaves the polyomavirus VP1 protein at a 
single lysine residue (K28) near the amino terminus (19). In 
order to characterize protease-accessible domains of LI, the 
purified recombinant LI protein was subjected to trypsin di- 
gestion. After digestion with trypsin, two progressive cleavage 
products were identified by SDS-PAGE (Fig. 3A). The first 
(LI a) was an =48-kDa species obtained after digestion at 
either 4 or 25°C. The second was an =»42-kDa species (trpLl) 
which was obtained after digestion at 37°C for 2 h (Fig. 3A, 
lane 4) and which remained unchanged upon longer incuba- 
tions. These cleavage products were characterized by amino- 
terminal protein sequencing after electrotransfer to an Immo- 
bilon filter. Both products had an intact amino terminus, 
indicating cleavage sites at the carboxy terminus. When di- 
rectly compared, CA86L1 (Fig. 2A, lane 5) (calculated molec- 
ular mass, 45.7 kDa) migrated at the same apparent molecular 
weight by SDS-PAGE as trpLl (Fig. 3A, lane 4), suggesting 
that R415 is the cleavage site used to generate trpLl. Based 
upon its SDS-PAGE mobility in relation to the full-length and 
trpLl proteins, the Lla cleavage site is likely to be at residue 
K450, K460, or R471. The resistance of the amino terminus to 
digestion suggests that this domain of LI is relatively inacces- 
sible to digestion in comparison to the carboxy terminus. 

In order to confirm the results of the previous DNA binding 
experiments, the trypsin-digested LI cleavage products were 
analyzed with the Southwestern assay (Fig. 3C). Consistent 
with the CA86L1 and CA11L1 results, both trypsin cleavage 
products did not bind DNA in this assay. In addition, both 
trpLl and Lla eluted from phosphocellulose in buffers con- 
taining 100 mM NaCI (data not shown), again correlating de- 
creased phosphocellulose binding with deletion of the putative 
LI DNA-binding domain. 

Sedimentation analysis identifies inter- and intrapentamer 
bonding domains. Sedimentation analysis has been used to 
separate polyomavirus subviral intermediates as well as to as- 
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FIG. 3. Trypsin digestion of LI. LI protein was partially purified after expression in E. coli; digested for 2 h with trypsin at the indicated temperatures; subjected 
to SDS-PAGE; and stained with Coomassie blue (A), immunoblotted with an anti-Ll antiserum (B), or subjected to Southwestern analysis (C). Lanes: 1, no trypsin 
at 37°C; 2 to 4, trpsin at 4, 20, and 37°C, respectively. Lla and trpL1 indicate major proteolytic products. 



sess capsomere and capsid formation (9). For polyomavirus, 
the pentameric VP1 capsomere sediments at 7.5S, and the 
capsid sediments at 140S (25). To assay the effect of LI mu- 
tations on capsomere and capsid formation, velocity sedimen- 
tation analysis of full-length, trypsinized, and mutant LI pro- 
teins was performed. Given the molecular weight of LI, 
pentamers of =*270 kDa might be expected to sediment at 10 
to 14S. Under the conditions shown in Fig. 4, capsid-like par- 
ticles and aggregates having a sedimentation value greater than 
19S were pelleted to the bottom of the gradients. The associ- 
ated nature of the full-length LI protein was made evident by 
the small amount of protein sedimenting between 7 and 14S 
compared with the amount present in the pellet fraction. The 



majority of the CA11L1 protein also pelleted, suggesting capsid 
or aggregate formation in this preparation (data not shown). 
The trpLl protein showed a major component with a sedimen- 
tation coefficient of 11 to 12S, a value consistent with that 
expected for unassociated pentamers. 

The mutant proteins with amino-terminal deletions, 
NA29L1 and NA61L1, precipitated during the dialysis before 
the DE-52 column, and only a small amount of soluble mutant 
protein was obtained. Therefore, their presence in the sucrose 
gradients was monitored by SDS-PAGE and with the South- 
western assay. Both proteins sedimented with a predominant 
species at 4S (Fig. 4) (data for NA61L1 are not shown). Fur- 
thermore, a Centricon-100 spin column (Amicon) with a 100- 




trpL1> 



N29L1 ^ -mm* mm W 

FIG. 4. Sucrose gradient sedimentation of LI proteins. SDS-PAGE analysis of fractions after sedimentation of LI proteins in 5 to 20% sucrose gradients. Proteins 
were delected by immunoblotting (LI and trpLl) or Southwestern assay (NA29L1). Sedimentation markers in a parallel gradient were p-galaclosidasc (19S), catalase 
(11. 3S), and hemoglobin (4.5S). sm, starting material. 
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kDa exclusion limit was used to concentrate the mutant 
NA29L1 and NA61L1 proteins. Under these conditions, pen- 
tameric capsomeres should be retained. Immunoblot analysis 
showed that the mutant proteins passed through the Centri- 
con-100 (data not shown), providing additional evidence that 
these truncations at the amino terminus affected pentamer 
formation and resulted in an LI protein species of approxi- 
mately monomer-to-dimer size. 

Recombinant LI self-assembles in vitro into capsid-like par- 
ticles. HPV-11 LI protein purified after recombinant baculo- 
virus expression in Sf9 insect cells self-assembles into VLPs 
(23). Based upon this result, LI protein purified after expres- 
sion in E. coli was analyzed for in vitro capsid assembly by 
electron microscopy. As shown in Fig. 5A, the LI protein, 
which eluted from the phosphocellulose column in 1 M NaCl, 
self-assembled into capsid-like structures with a diameter sim- 
ilar to that of papillomavirus virions (55 nm). In addition, 
unassembled capsomeres with a central stain-filled hollow, 
which appeared similar to the pentameric capsomeres of poly- 
omavirus VP1, could also be identified (25). Electron micro- 
graphs of the trypsin digestion products, Lla and trpLl, 
showed a uniform population of pentamer-like structures (Fig. 
5B and C), although many of the pentamers appeared to lie on 
their sides, presenting an H or V shape. Capsid-like structures 
were no longer present after trypsin digestion of LI. Higher- 
magnification views (Fig. 5C) verified the similarity of these 
structures to truncated pentamers of the polyomavirus VP1 
protein (8). The CA11L1 protein was further purified by su- 
crose gradient sedimentation in order to obtain a homoge- 
neous preparation suitable for electron microscopy. Electron 
micrographs demonstrated that CA11L1 is also capable of in 
vitro self-assembly into capsid-like aggregates, although they 
are of less uniform appearance than those of the full-length LI 
protein (Fig. 5D). Because of the difficulty in purifying the 
CA86L1 protein, this mutant protein could not be analyzed by 
electron microscopy. 

In order to test whether LI assembly occurred in vivo, bac- 
teria expressing LI were fixed, sectioned, and directly exam- 
ined by electron microscopy. No capsid-like particles were seen 
in the bacteria, although inclusion bodies were present (data 
not shown). In addition, crude bacterial lysates were subjected 
to sucrose gradient sedimentation, and fractions were assayed 
both by immunoblotting for LI and by electron microscopy. 
Only rare capsid-like particles could be detected in the pres- 
ence of abundant LI protein (data not shown). Thus, capsid 
assembly occurs after bacterial lysis and is likely facilitated by 
the exposure of the LI protein to high ionic strength, e.g., the 
phosphocellulose column elution buffer, or to other in vitro 
conditions which may favor capsid formation. 

Immunologic reactivity of bacterially expressed HPV-U LI. 
Results from previous studies have demonstrated that VLPs 
have conformationally dependent capsid-neutralizing antigenic 
domains similar to those of native HPV-11 virions (24). In 
addition, these epitopes are antigenically distinct from similar 
epitopes of HPV-16 and HPV-18 (22). The antigenic proper- 
ties of HPV-11 capsids and isolated capsomeres obtained after 
expression in E. coli were compared with VLPs in an ELISA 
(Fig. 6, top panel) (4, 24). Antisera raised against either native 
HPV-11 virions (4) or HPV-11 VLPs (24) reacted indistin- 
guishably with all of the HPV-11 LI antigens tested. Dena- 
tured VLPs reacted much less efficiently in this assay (data not 
shown), indicating that nonlinear epitopes were the predomi- 
nant epitopes recognized by these antisera. Virus type speci- 
ficity of the epitopes was verified by the failure of antibodies 
raised against either HPV-16 or HPV-18 capsids to react with 
the HPV-1 1 antigens (Fig. 6, bottom panel). These findings are 



consistent with previous results (22) and indicate that HPV-11 
capsids and LI capsomeres purified after expression in E. coli 
are antigenically similar to HPV-11 VLPs and, thus, to native 
HPV-11 virions. 



DISCUSSION 

We have expressed and purified the HPV-11 LI protein in E. 
coli. The purified protein is soluble and has many of the bio- 
chemical properties found previously for the recombinant 
polyomavirus VP1 protein, such as DNA binding, pentameric 
morphology, and the ability to self-assemble into capsid-like 
particles. The introduction of site-directed mutations into the 
expression vector has allowed identification of three functional 
domains of the LI protein: (i) a DNA-binding region near the 
carboxy terminus, (ii) a region within 86 amino acids of the 
carboxy terminus required for intercapsomere bonding, and 
(iii) a region near the amino terminus which affects pentamer 
formation. The positions of these domains distinguish HPV-11 
LI from polyomavirus VP1, in that the VP1 DNA-binding 
domain is located at the amino terminus and the intercapsom- 
ere bonding domain is at the carboxy terminus. This difference 
suggests an alternative bonding strategy for LI in the papillo- 
mavirus capsid compared to that of VP1 in the polyomavirus 
capsid. 

Digestion with trypsin was used to identify accessible resi- 
dues and thus provide evidence for conformationally exposed 
regions of the LI pentamer. Characterizations of papillomavi- 
rus structural proteins derived from purified virions by SDS- 
PAGE have previously noted minor capsid protein species 
which maybe degradation products related to those generated 
by trypsin (6, 7). Interestingly, trypsin cleavage sites were 
found only at the carboxy terminus, extending inward approx- 
imately 86 residues. The inability of the resulting trpLl pen- 
tamers to self-assemble while maintaining their pentameric 
morphology is similar to the behavior of the polyomavirus 
ANCO VP1 mutant protein (8). This result provides evidence 
that the intercapsomere linker arm of LI is located at its 
carboxy terminus. The PPP motif around residue 404 (Fig. 6) 
may be similar to the PYP motif in polyomavirus (residue 320) 
and SV40 (residue 300) (15). These residues signal the exit of 
the polypeptide chain from the body of the pentamer and its 
redirection towards its capsomere neighbor, and therefore the 
PPP motif may represent the beginning of an interpentamer 
linker arm for LI. The ability of the CA11L1 mutant protein to 
assemble may therefore define the linker arm as the domain 
between LI residues 407 and 490. 

The sedimentation characteristics of the mutant and 
trypsinized LI proteins were consistent with the morphology of 
the proteins determined by electron microscopy. The trpLl 
protein is morphologically similar to the polyomavirus VP1 
pentameric capsomere, and the 12S value is therefore repre- 
sentative of an LI capsomere. The proteins with amino-termi- 
nal deletions, NA29L1 and NA61L1, sediment at less than 
capsomere size, suggesting that pentamer oligomerization is 
disrupted by these deletions. The motif PPP, residues 12 to 14 
(Fig. 7A), may indicate a chain redirection at the amino ter- 
minus of LI, and it is possible that deletions before this signal 
may not affect pentamer formation. 

With the Southwestern assay, a DNA-binding domain was 
localized to the carboxy-terminal 11 residues of HPV-11 LI 
(Fig. 2). We assume that this DNA binding is sequence non- 
specific, because a plasmid probe was used for its identification 
and because similar results for the polyomavirus VP1 protein 
have indicated no preference for the source of DNA (19). A 
previous reported study of the HPV-16 LI protein in which our 
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FIG. 6. Antigenic properties of the HPV-1 1 LI protein purified after expres- 
sion in E. coli. (Top panel) Polyclonal rabbit antisera againsl HPV-11 VLPs 
(filled symbols) or HPV-11 virions (open symbols) were compared in an ELISA 
for reactivity againsl VLPs (triangles), capsids prepared from L1 purified after 
expression in E. coli (squares), or trpLl capsomcres (diamonds). (Bottom panel) 
Type-specific polyclonal HPV capsid antisera were tested for immunoreaclivity 
in an ELTSA againsl VLPs (white bars), trpLl capsomeres (grey bars), or capsids 
assembled from LI purified after expression in E. coli (black bars). Tested were 
HPV-11 virion antiserum (A), HPV-11 VLP antiserum (B), HPV-16 VLP anti- 
serum (C), and HPV-1 S VLP antiserum (D). 



protocol was used found no DNA binding activity for this 
protein, although the HPV-16 LI protein has a similar basic 
region at its carboxy terminus (31). However, fusion proteins 
for HPV-16 and 6b LI proteins have demonstrated DNA bind- 
ing (16); thus, with the current data we believe that LI is a 
DNA-binding protein. However, this activity may be modu- 
lated if associated with L2 in a manner similar to VP1 associ- 
ation with VP2/VP3 (5). A nuclear localization signal (NLS) 
sequence was also identified in this region of the HPV-16 LI 
protein (29). By inspection, it is likely that a similar NLS 
sequence may be present in the HPV-11 LI protein as well 
(Fig. 7). By analogy to polyomavirus VP1, the DNA binding 
and NLS sequences may overlap, and further site-directed 
mutagenesis will be required to identify residues important for 
each function (18, 19). The biological significance of this DNA 
binding property also requires further investigation. 
The assembly of capsid-like aggregates of HPV-11 LI par- 



A 



MWRPSDSTVY-VPPPNPVSKV-VATDAYVKRT-NIFYHASSSR-LLAVGHPYYS- 

G0 70 80 90 100 

IKKVNKTWP-KVSGYO.YRVF-KWLPDPNKF-ALPDSSLFDP-TTQRLVWACT- 



410 420 430 440 450 

GLSPPPNGTL-EDTYRYVQSCi-AITCQKPTPE-KEKQDPYKDM-SFWEVNLKEK- 

460 470 480 490 500 

FSSELDQFPL-GRKFLLQSGY-RGRTSARTGI-KRPAVSKPST-APKRKRTKTK-K 




FIG. 7. Summary of LI proteins analyzed. The LI amino- and carboxy- 
n i I t i t i i ) hown al ith the truncated LI protein 

(B) tested for DNA binding and capsid self-assembly. A consensus bipartite NLS 
is present at residues 480 to 481 through 492 to 495. 



allels the behavior of this protein when expressed in insect cells 
with baculovirus vectors (23). However, the VLPs purified 
from the nucleus of the insect cells were more homogeneous in 
appearance than those generated from in vitro assembly of the 
E. coli recombinant LI protein. This behavior was previously 
noted for the polyomavirus VP1 protein and has been attrib- 
uted to more ideal assembly conditions present intracellularly 
than in vitro (17). In contrast to polyomavirus VP1, the in 
vitro-assembled LI is very difficult to disassemble under con- 
ditions of calcium chelation and disulfide bond reduction 
(14a), and by sucrose gradient analysis the majority of the 
protein was in an aggregated form. Only with concurrent tryp- 
sin digestion could capsid-like aggregates be fully disrupted, 
again suggesting differences in interpentamer bonding between 
LI and polyomavirus VP1. The recent observation of "holes" 
in a subset of BPV capsids suggests an alternative mechanism 
of capsid dissociation (3). Calcium chelation and/or disulfide 
bond reduction may cause a conformational change in the 
capsid, resulting in the opening of holes between capsomeres. 
The carboxy termini of the LI proteins may then become 
exposed to a trypsin-like protease, which cleaves LI, resulting 
in irreversible capsid disassembly. This type of conformational 
change with subsequent sensitivity to proteases has been well 
described for plant viruses (11, 13, 20) and may be utilized 
during virus uncoating after infection. Without induction of 
such a conformational change, however, BPV virions are re- 
sistant to trypsin and other proteases (21). 

The ELISA analysis suggests that HPV-11 LI purified after 
expression in E. coli is antigenically similar to VLPs and thus to 
native HPV-11 virions. The results obtained with the trpLl 
protein also demonstrate that the unassembled capsomere sub- 
unit retains the overall antigenicity of VLPs and capsids, since 
the reactivities with polyclonal antisera were indistinguishable 
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between capsomeres (trpLI), capsids, and VLPs over the ti- 
tration range of the ELISA. Previous attempts to generate 
native immunoreactivity with LI proteins purified after expres- 
sion in E. coli may have been unsuccessful because these pro- 
teins were denatured. Although the ability of individual cap- 
someres to induce HPV-11 virus-neutralizing antibodies 
remains to be tested, these results suggest that LI capsomeres 
may possess a neutralizing conformational domain, character- 
ized previously with the same antisera (4, 22, 24). 

The ability to purify the HPV-11 LI protein in a soluble, 
nondenatured form after expression in E. coli now permits a 
detailed analysis of the intercapsomere bonding domain and 
buffer conditions which favor papillomavirus capsid assembly. 
Further studies will be required to test whether the recombi- 
nant HPV-11 capsomeres can generate a neutralizing-antibody 
response which is biologically significant. 
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